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Preface to the Third Edition 



This book is a revised and expanded version of the second edition of Space 
Physics. The first part introduces basic concepts and formalisms which are 
used in almost all branches of space physics. The second part is concerned 
with the application of these concepts to plasmas in space and in the helio- 
sphere. More specialized concepts, such as collisionless shocks and particle 
acceleration, are also introduced. The third part deals with methodological 
considerations. It consists of an expanded chapter on space measurement 
methods and a new chapter on general methodological problems. This last 
chapter is relevant in that it points out the differences between laboratory 
physics and physics in a complex natural environment, in particular the prob- 
lems of limited knowledge - or as Pollack [415] puts it, “Uncertain Science 
. . . Uncertain World” . In Part II, in most chapters a section “What I Did Not 
Tell You” has been added - it should help the reader to understand some 
crucial assumptions underlying the basic ideas introduced in the text and 
might help you to appreciate the limitations of our knowledge and our mod- 
els. These sections also give illustrative examples that help to understand the 
last chapter. 

This edition has also been expanded by numerous examples, in particu- 
lar in Part I. They illustrate basic concepts and aid the reader in the ap- 
plication of these concepts to real problems. In addition, new results from 
recent space missions, such as ACE, TP. ACE, and Wind, have been added. 
In the appendix, a list of Internet resources has been added. This list can 
also be found (in a “clickable” version) at www.physik.imi-osnabrueck. 
de/sotere/spacebook/intro .html. On that page, supplementary material 
to this course can be found, too. 

The idea of the book is an introduction to many aspects of space plasmas. 
Obviously, this approach has the disadvantage that a specialist in any of the 
subfields will be disappointed that his or her field is dealt with in only a brief 
and very elementary way. That is, without doubt, true. My idea, however, 
is to introduce the basic concepts to the novice not already specialized in 
any field and to help the specialist to easily grasp some ideas in other fields. 
Therefore the focus is on concepts rather than on detailed mathematical 
analysis. References should help both the novice who is looking for a deeper 
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Preface 



formal treatment and the specialist who wants to find reviews giving more 
details. 

There are also a few good books on plasma physics and/or space physics 
which can be recommended to the reader. A very accessible, unmatched in its 
style and consciencious approach, book on plasma physics is Plasma Physics 
and Controlled Fusion by F.F. Chen [97]; a well- written and up-to-date ac- 
count of the phenomena in space plasmas is given in Introduction to Space 
Physics^ edited by M.G. Kivelson and C.T. Russell [290]. These two books 
cannot be matched by the present one and can serve as valuable supplements. 
More formal introductions to plasma physics are Plasma Dynamics by R.O. 
Dendy [128] and Plasma Physics by R.J. Goldston and P.H. Rutherford [192]. 
Very good introductions to plasma physics of the kind required by a space sci- 
entist are given in Physics of Space Plasmas by G.K. Parks [397], Physics of 
Solar System Plasmas by T.E. Cravens [113], and Basic Space Plasma Physics 
by W. Baumjohann and R.A. Treumann [36] and its sequel Advanced Space 
Plasma Physics [520] . A useful collection of plasma formulas can be found at 
wwwppd . nrl . navy . mil/nrlf ormulary/ nrlf ormulary . html. 

As in the earlier editions, symbols in the margin help to guide you through 
the text. The symbols are 

• This section contains an example from space plasmas to illustrate a physical 
concept. Such a section might be skipped by a reader who is interested 
primarily in the concepts and less in space science. 

• This section is more formal, but is not vital for an understanding of basic 
observations and ideas. It might be skipped by a reader who is mainly 
interested in an introduction to space physics. 

• An apparently confused reader, “Whatnow”, marks supplementary sec- 
tions: although the ideas presented here are important in space physics, the 
theoretical background is complicated and only briefly sketched. In partic- 
ular, the beginner in space physics should feel free to skip these sections 
on first reading and return to them later after becoming more acquainted 
with the topic. 

• This text points to hotly debated topics and fundamental open problems. 

I am grateful to the following persons, who all contributed to the devel- 
opment of this book: Andre Balogh, R.A. Cairns, Stanley H. Cowley, Ulrich 
Fischer, Roman Hatzky, Bernd Heber, Eberhard Moebius, Reinhold Miiller- 
Mellin, Constantinos Paizes, Wilfried Schroder, Gunter Virkus, C.L. Waters, 
Gerd Wibberenz, and even an anonymous reader who sent hints about errors 
without being traceable. I am grateful to the helpful team at Springer, in 
particular Claus Ascheron, Adelheid Duhm, Gertrud Dimler, Ian Mulvany, 
and Frank Holzwarth. And - last but not least - a big thank-you to Klaus 
Betzler. 



Osnabriick, December 2003 



May-Britt Kallenrode 
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Part I 



Plasmas: The Basics 




1 Introduction 



We shall not cease from exploration. 
And the end of all our exploring 
will be to arrive where we started 
and know the place for the first time. 

T.S. Eliot, Little Gidding 



1.1 Neutral Gases and Plasmas 

Matter, in our daily experience, can be divided into solids, liquids, and gases. 
Manipulation of matter has shaped our scientific world view as well as our 
intuitive understanding of its different states and their behavior. But mov- 
ing upwards from the surface of the Earth, our environment changes and 
no longer fits into this picture: starting at a height of about 80 km, the 
atmosphere contains an ionized particle component, the ionosphere. With in- 
creasing height, the relative importance of the neutral component decreases 
and ionized matter becomes dominant. Farther out in the magnetosphere and 
in interplanetary space almost all gas is ionized: the hard electromagnetic ra- 
diation from the Sun immediately ionizes almost all matter. Space therefore 
is dominated by a plasma, the “fourth state of matter” . 

A plasma differs from a neutral gas in so far as it (also) contains charged 
particles. The number of charged particles is large enough to allow for elec- 
tromagnetic interactions. In addition, the number of positive and negative 
charges is nearly equal, a property which is called quasi-neutrality: viewed 
from the outside the plasma appears to be electrically neutral. The reason for 
this quasi-neutrality can be understood from the electrostatic forces between 
charged particles. For instance, in a gas discharge a typical length scale is 
L = 0.01 m and a typical number density number density of the electron gas 
is Tie = 10^° m“^. The electric field on the surface of a sphere with r = L 
containing only the electron gas but no ions is then E lO^o V/m. Such 
a strong field will immediately cause a rearrangement of charges and quasi- 
neutrality will be restored. In the rarefied plasmas in space, number densities 
are smaller by many orders of magnitude (see Fig. 1.1); however, since the 
spatial scales are measured in kilometers or even thousands of kilometers. 
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the same argument can be applied: on the relevant spatial scales the plasma 
is quasi-neutral even in the rarefied plasmas in space, although this is not 
necessarily the case on the centimeter scale. 

Because a plasma (partly) consists of free charges, it is a conductor. Mov- 
ing electric charges are currents. These currents induce magnetic fields which 
in turn infiuence the motion of the very particles forming the field-generating 
currents. Thus the particle motion in a plasma is not only controlled by ex- 
ternal electric and magnetic fields, but also creates fields which add to the 
external ones and modify the motion of the particles: a plasma can interact 
with itself. Consequently, dynamics in a plasma are more complex than in a 
neutral gas. This is most obvious in the large number of different types of 
plasma waves (Chap. 4). 

In apparently simple situations, a plasma can behave counter-intuitively. 
Pouring milk into our coffee, we expect the milk to heat up and mix with the 
coffee. A sunspot is a sharply bordered volume of cool gas embedded in the 
hot solar photosphere; but it stays stable for several months prevented by 
strong magnetic fields from warming or mixing with its environment. A cold 
and dense volume of gas or liquid in a hot environment sinks. A solar filament 
is cold and dense compared with the ambient corona but it is held in position 
against gravity by strong magnetic fields. Such discrepancies between our 
daily experience and the behavior of ionized gases clearly show that plasmas 
do not form a significant part of our environment. Why then do we study 
such exotic phenomena? Are there applications for plasmas? 

First, plasmas are not exotic but quite common. The interplanetary and 
interstellar medium and the stars are made of ionized gases. Thus about 
99% of matter in the universe is plasma. Nearest regions dominated by plas- 
mas are the magnetosphere with its radiation belts, the ionosphere, lightning 
bolts in the atmosphere, and, in a wider sense, the Earth’s core; thus even 
in the system Earth plasmas are not uncommon. Plasma physics, therefore, 
contributes to the understanding of our environment. In turn, the natural 
plasma laboratories, i.e. the ionosphere, the magnetosphere, and interplane- 
tary space, help to test the concepts of plasma physics on spatial scales and 
at densities unattainable in a laboratory. 

Even some everyday materials can be described as plasmas because they 
show similarities to the free-electron plasma described above: the conduction 
electrons in metals and electron-hole pairs in semiconductors are charges 
which can move quasi-freely and lead to a behavior of the matter which can 
be described in the same way as for a plasma. The free-electron gas in metals 
is therefore also included as example of a plasma in Fig. 1.1. 

Second, plasmas can be used for quite worldly applications. One of the 
most ambitious projects is nuclear fusion: to merge hydrogen atoms to helium, 
imitating the processes inside the Sun (Sect. 6.1) and the stars, in order to 
create a clean and long-lasting power source. The main aspects of this project 
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are the production of a plasma with suitable properties (density, temperature, 
losses) and its confinement inside a magnetic field. 

There are also less spectacular applications of plasma physics. Chemistry 
utilizes the different chemical reactions in plasmas and neutral gases: for 
instance, cyan gas can be synthesized by burning coal dust in a nitrogen 
electric arc plasma. Plasma beams are used for ion implantation in microchip 
production. Plasma burners and pistols are used to cut, weld, or clean metals. 
Other technical applications of plasmas are as diverse as lasers, capacitors, 
oscillators, and particle accelerators [323]. 



1.2 Characterization of a Plasma 

The main characteristics of a plasma are its electron temperature Tq and the 
electron number density rig. The first gives a meetsure of the thermal energy or 
more correctly the average kinetic energy of the particles (see Sect. 5.1): E'th = 
mt;th/2 = 3fcBTe/2, where A:b is Boltzmann’s constant. The temperature is 
often given in the units of particle energy, electronvolts (eV), where Te[eV] = 
3£’th/3[eV]. The temperature and number density are given for the electron 
component: whereas ions, owing to their larger mass, are rather immobile 
and on many occasions can be regarded as a fixed background of positive 
charges, the electrons are the mobile part of a plasma. 

The second parameter, the electron number density rig, is an indicator of 
the particle motion. In a low density plasma, particle motion is determined 
by the electric and magnetic fields only, while for high densities the inter- 
actions between the particles dominate. Thus the two parameters Tg and 
combined classify a plasma with respect to (a) interactions between plasma 
constituents, (b) the relative importance of electromagnetic fields for the par- 
ticle motion, and (c) the range over which particles can propagate freely. 

Figure 1.1 shows such an rig/Tg diagram. Some typical plasmas are in- 
dicated. Note that both parameters extend over many orders of magnitude. 
Astrophysical plasmas can be found anywhere in this diagram: the rarefied 
ionospheric plasma has a rather low temperature, while the rarefied plasmas 
in the magnetosphere and in the solar wind have much higher temperatures. 
All three have densities far below those in terrestrial plasmas such as gas 
discharges and lightning. Other astrophysical objects, such as the interior of 
the Sun, have higher densities and temperatures. The only terrestrial plasmas 
with comparable or higher temperatures are fusion plasmas. 

The dotted and dashed lines in Fig. 1.1 indicate two additional plasma pa- 
rameters: the Debye length Ad and the number A^d of particles inside a sphere 
of radius Ad- The Debye length gives the spatial scale over which particles 
in a plasma exert electrostatic forces on each other (Sect. 3.7). Ad increases 
with decreasing density because in a dense plasma charges of opposite sign 
screen each other, and it increases with increasing temperature because, ow- 
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Log Electron Number Density (l/m^) 




Log Electron Number Density (1/m^) 

Fig. 1.1. Characteristics of a plasma. (Top) Electron temperature Te, electron 
number density rie, Debye length Ad, and number Nd of particles inside a sphere 
of radius Ad, for diflPerent plasmas. (Bottom) Definition of different plasmas using 
the characteristic energies 



ing to the increased thermal motion of the particles, quasi-neutrality can be 
violated on a larger spatial scale. 

An Uq/Tq diagram can also be used for a general classification of plasmas 
as shown in the lower panel of Fig. 1.1. Five characteristic energies provide the 
reference frame for classification: the thermal energy E'th, the non-relativistic 
Fermi energy the electrostatic energy FJei, the energy of the ground state 
F^bo? aiid the relativistic electron energy E’rei. Since these energies depend 
on the particle species under study, we shall not discuss the characteristic 
energies in general but shall do so only for the example of hydrogen (Z = 1). 
This is also the most common element in space plasmas. 

The first characteristic energy is given by a thermal energy equal to that 
of the ground state, that is, the ionization energy: Eth = F^bo- This charac- 
teristic energy is marked by the lower horizontal line in the lower panel of 
Fig. 1.1: above this line the plasma is (fully) ionized, below it is (almost) 
neutral. This characteristic energy therefore divides neutral gases from ideal 
plasmas. Note that this is a very simple description for two reasons, (a) Cer- 
tainly, there will be no sharp boundary between neutral and fully ionized, and 
even for a given temperature there will be stochastic variations in the degree 
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of ionization, (b) The ionization also depends on the density, as described by 
the Saha equation: 



_ (27rme)^/^ f Ebo ] 

X - 1 Pgas I ksT j ’ 



( 1 . 1 ) 



where x is the degree of ionization, rrie is the electron mass, pgas is the total 
gas pressure, and h is Planck’s constant. A high degree of ionization might be 
obtained even if the temperature was only 1/10 of the ionization temperature. 
Nonetheless, since we are mainly concerned with rarefied plasmas at rather 
high temperatures, the above distinction is sufficient for the purpose of this 
book. 

If we increase the temperature further, we obtain a characteristic energy 
where the thermal energy equals the electron’s relativistic energy: E'th = ^rei- 
This is indicated by the upper horizontal line in Fig. 1.1. The plasma above 
this line is said to be relativistic. Here the scattering of photons from electrons 
has to be described as Compton scattering, and the equilibrium radiation field 
has enough energy to allow pair production. 

With increasing density, the plasma degenerates: the energy distribution 
is no longer Maxwellian but is described by a distribution in which all phase 
space cells up to the Fermi energy Ep are filled, while at higher energies the 
population density decreases rapidly. This characteristic energy is indicated 
by a solid inclined line and separates degenerate and non-degenerate plasmas. 

The last characteristic energy relates the thermal energy to the electro- 
static energy: Eth — Eq\. Plasmas to the left of the corresponding charac- 
teristic line are ideal: here the kinetic energy of a particle is larger than its 
potential energy. In the non-ideal plasmas to the right of the characteristic 
line the electrostatic interaction is predominant. 



1.3 Plasmas in Space 

This book focuses on natural plasmas in space. Depending on their location, 
these plasmas exhibit different properties as characterized by the plasma 
parameters (see Fig. 1.1); all space plasmas, except for stellar interiors, can be 
characterized as ideal plasmas. Stellar interiors consist of hot and extremely 
dense plasmas with the highest densities inside white dwarfs; these plasmas 
are ideal but degenerate. Plasma density and temperature decrease in the 
stellar coronae which are still hot enough to be blown away as stellar winds. 
The combined action of the stellar wind and the magnetic field slows down 
the star’s rotation and winds up the magnetic field lines. This “starsphere” 
is a void in space, filled by plasma and magnetic fiux from the star. The 
interstellar medium, which fills the space between the starspheres, is most 
likely an even more attenuated plasma than stellar winds are. 

The spatially closest example for such a starsphere is the heliosphere 
(Chap. 6), see Fig. 1.2: a void in the interstellar medium structured by the 
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solar wind and the frozen-in solar magnetic field. It is separated from the 
interstellar medium by the heliopause. The heliosphere has an extent of at 
least about 100 AU^. Voids in the heliosphere exist too: the interaction be- 
tween the solar wind and a planetary magnetic field forms a magnetosphere: 
a cavity in the solar wind, dominated by the planet’s magnetic field. 

The topology of the magnetosphere (Chap. 8), is even more complex; 
however, the physical processes, although on smaller spatial scales, are the 
same. A magnetosphere is defined as a spatial region where the motion of 
particles is governed by the planet’s magnetic field. ^ This brief definition 
contains a lot of information. We learn the obvious: the very existence of the 
magnetosphere requires a magnetic field. Particles in the magnetosphere are 
at least partly charged: the motion of neutrals would not be infiuenced by the 
magnetic field. Their density is low: in a dense medium, collisions between the 
particles would determine their motion, and the infiuence of the magnetic field 
would be negligible. In addition, the energy density of the charged particles 
is small compared with the energy density of the field: otherwise the particle 
motion would distort the field instead of the field guiding the particles. 

The inner boundary of the magnetosphere is determined by the density: 
getting closer to the planet, the density increases. Brownian motion becomes 
dominant and the magnetic field no longer guides the particles. In the Earth’s 
magnetosphere this transition happens at a height of a few hundred kilome- 
ters. The upper ionosphere, extending to a height of about 1000 km, lies 
well inside the magnetosphere. The outer boundary of the magnetosphere is 
the magnetopause. It separates the planetary and the interplanetary mag- 

^ AU is short for astronomical unit which is the average distance between the Sun 
and the Earth: 1 AU = 149.6 • 10® km. 

^ With substituting “planet’s” by “Sun’s” or “star’s” we could use this statement 
as a definition for the heliosphere or a starsphere, too. 
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Fig. 1.3. Height scales in the near-Earth environment 



netic fields and prevents most of the solar wind plasma from entering the 
Earth’s magnetosphere and atmosphere. Moving towards the Sun, we would 
detect this boundary at a distance of about 10 Earth’s radii. In the opposite 
direction, the magnetosphere extends far beyond the orbit of the moon. 

Three different height regimes are summarized in Fig. 1.3. Regimes can 
be distinguished according to charge or “mixedness”. Below about 80 km, 
the atmosphere is completely neutral. In the ionosphere, the relative number 
of ionized particles increases with height. Below 100 km, in the homosphere, 
particles collide frequently. Thus the different atmospheric constituents are 
mixed thoroughly. In the heterosphere, above 100 km, particle motion is still 
dominated by collisions but different constituents start to separate, the degree 
of ionization increases, and neutrals are atomic rather than molecular. Above 
500 km, in the magnetosphere, collisions are infrequent, particles are charged, 
and particle motion is determined by the magnetic field. 

The main components of a magnetosphere are summarized in Fig. 1.4. 
The magnetosphere is formed by the interaction between the solar wind and 
the geomagnetic field. A boundary sheet, the magnetopause, forms where 
the pressure of the solar wind equals the magnetic field pressure. The solar 
wind streams around the magnetopause but it does not penetrate into the 
magnetosphere. Where the supersonic solar wind is slowed down to subsonic 
speed, the bow shock develops. At high latitudes, polar cusps form, separat- 
ing closed magnetic field lines in front of the magnetosphere from open field 
lines pulled away to the magnetotail by the solar wind. At these cusps, par- 
ticles and plasmas can penetrate into the magnetosphere and subsequently 
precipitate down into the atmosphere. 

The basic ingredient of the magnetosphere, the geomagnetic field, origi- 
nates in a magnetohydrodynamic (MHD) dynamo in the ionized fiuid inside 
the Earth’s core. A similar process can be found inside other planets; solar 
and stellar magnetic fields originate in dynamo processes too. In the solar 
system all planets except for our two neighbors. Mars and Venus, house a 
sufficiently strong MHD dynamo to build a planetary magnetic field and to 
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Fig. 1.4. Structure 
of the Earth’s mag- 
netosphere. Reprinted 
from H. Rosenbauer 
et ah, [447], J. Geo- 
phys. Res. 80, Copy- 
right 1975, American 
Geophysical Union 



form a magnetosphere. But even planets without magnetospheres and comets 
are shielded against the solar wind: when the supersonic solar wind hits their 
atmospheres, it is slowed down and deflected around the obstacle, forming a 
bow shock in front of it. 

Magnetospheres are not stationary but change as the basic ingredients, the 
planetary magnetic held and the solar wind, vary in time. Disturbances in the 
solar wind, caused either by temporal and spatial variations or by transient 
phenomena, shake the magnetosphere and lead to geomagnetic storms and 
aurorae. When the terrestrial magnetic held varies in strength, the spatial 
extent of the magnetosphere changes too. And when the dipole axis drifts, 
the structure of the magnetosphere is modifled. All these different modes of 
magnetospheres can be observed in the solar system (Chap. 9). 

Space physics is not only concerned with plasmas and fields but also with 
energetic particles (Chap. 7). Their energy by far exceeds the kinetic energy 
of plasma particles although, owing to its larger density, the energy den- 
sity of the plasma exceeds that of the energetic particles. An understanding 
of the acceleration and propagation of these particles also is an important 
topic. For instance, fluctuations in plasma motions on the Sun, interactions 
of different plasma streams, and plasma clouds ejected from the Sun excite 
different kinds of waves. Some of these waves steepen during their propa- 
gation, forming shock waves. Energetic particles of solar and galactic origin 
interact with these waves: this results in spatial scattering as well as scat- 
tering in momentum space. These wave-particle interactions are of foremost 
interest in understanding space plasmas; however, they are also formally dif- 
flcult because they delve deeply into non-linear processes. 
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Plasmas and particles also influence the terrestrial environment. For 
instance, geomagnetic disturbances related to the arrival of large plasma 
clouds ejected from the Sun (coronal mass ejections) disrupt power and com- 
puter lines; energetic particles from a solar flare can ionize the atmosphere 
and reduce the ozone concentration. Such questions are addressed in solar- 
terrestrial relationships (Chap. 10), also called “space weather” for short. 



1.4 A Brief History of Space Research 

In situ observations of plasmas and particles in the magnetosphere and in in- 
terplanetary space became possible with the advance of satellite technology 
in the late 1950s and early 1960s. Many of these observations are discussed 
within the broad topic of solar-terrestrial relationships. Solar-terrestrial re- 
lationships is an old field of science; it dates back to the first correlations 
between sunspots and aurora in ancient China a few thousand years ago. 
Aurorae are the prime example of solar-terrestrial relationships: they can 
be detected easily, even with the naked eye; they are closely correlated with 
solar activity; and they also have an aesthetic and even mythological quality. 
Nonetheless, the big steps in understanding solar-terrestrial relationships re- 
quired observations made on board rockets and satellites: measurements of 
plasmas and particles in the ionosphere, the magnetosphere, and interplan- 
etary space, and the measurement of the Sun’s electromagnetic radiation in 
frequency ranges not observable from the ground. 

The solar wind and the magnetosphere act as coupling devices in solar- 
terrestrial relationships. Both have been studied, though only indirectly, long 
before the space age. Magnetism was detected more than 2000 years ago 
when the ancient Greeks found stones that attracted iron. The first reliable 
description of a compass dates back to the eleventh century when Shon-Kau 
(1030-1093) wrote in a Chinese encyclopedia: “fortune-tellers rub the point of 
a needle with the stone of the magnet in order to make it properly indicate 
the south”. The first written account of a compass in Europe dates back 
to Alexander Neekam (1157-1217), a monk at St. Albans. He describes the 
compass and its application in navigation as common. Neekam’s compass is a 
second generation instrument and quite similar to the ones used today: while 
in the first compass a small piece of magnetic stone floated in water on a 
piece of wood or cork, in Neekam’s compass a needle is placed on a pivot, 
allowing it to rotate freely and align itself along the north-south direction. In 
the fourteenth century the compass was common on ships. The declination, 
the difference between magnetic and geographic north, was well known by 
the early fifteenth century [94]; its temporal variation was reported in 1634 
by Henry Gellibrand (1597-1636). Magnetic inclination was discovered in 
the second half of the sixteenth century independently by Georg Hartmann 
(1489-1564) and Robert Norman. 
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The reason for the north-south-directivity of the compass needle was less 
well understood. Philosophers of the early thirteenth century suggested some 
connection by virtue between the loadstone used to rub the compass needle 
and the polar star - the latter being a special star since, unlike other stars, 
it is fixed. Later that century, the idea of polar loadstone mountains was 
proposed. Again, the polar star was believed to give his virtue to the loadstone 
mountains which in turn imparted it to the compass needle leading it to point 
towards the polar star. This idea was questioned by Petrus Peregrinus (Peter 
the Wayfarer, ca. 1240-?): loadstone deposits can be found in many parts of 
the world. Why should the polar ones be the only ones that attract a compass 
needle? Peregrinus attacked this question in a manner which can be termed 
scientific by present day standards: he performed experiments with loadstone, 
reported in his Epistola de Magnete. In particular, he introduced the concept 
of polarity, discovered magnetic meridians, and described different methods 
to determine the positions of the poles of a spherical loadstone. 

But only in 1600, the basic ingredient of the magnetosphere, the geomag- 
netic field, was detected: in his treatise De Magnete [185], William Gilbert 
(1544-1604) suggested that the north-south alignment of the compass results 
from the magnetic field of the Earth. In the middle of the nineteenth century, 
scientists began to understand the terrestrial magnetic field. A global net- 
work of observatories started continuous registrations of the magnetic field 
and its fiuctuations. Prom these data, Carl Friedrich Gauss (1777-1855) pro- 
posed that the Earth’s magnetic field consists of two components, one from 
its interior and a second one generated in the atmosphere [182]. To first order, 
the internal geomagnetic field can be described as that of a homogeneously 
magnetized sphere. Hans Christian 0rstaed (1777-1851) and Andre Ampere 
(1775-1836) suggested ring currents inside the Earth as source of the internal 
field. This dipole-field approach survived up to the early 1960s when satellite 
and rocket observations in the upper atmosphere gave a more detailed picture 
of the true field and suggested modifications to the model. 

Gauss and Wilhelm Eduard Weber (1804-1891) initiated very precise 
measurements of the Earth’s magnetic field with relative accuracies of at 
least 10“^ [553,554]. Small fiuctuations could be detected, showing systematic 
variations in time and location as well as superimposed stochastic changes. 
These latter strongly indicated that the Earth is not an isolated object in 
space but that strong forces from outside act on spaceship Earth. 

In its heyday in the second half of the nineteenth century, solar-terrestrial 
relationships were an illustrative example of the development of science from 
correlations and apparently uncorrelated, sometimes even seemingly contra- 
dictory observations into a consistent picture of a complex environment. The 
converging developments included, for example, the discovery of the 11-year 
sunspot cycle by Heinrich Samuel Schwabe (1789-1875) in 1844 [470, 471] 
and the correlation between sunspot numbers and the frequency of geomag- 
netic disturbances by Edward Sabine (1788-1883) in 1852. In the same year. 
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Rudolf Wolf (1816-1893) found a correlation between sunspots and geomag- 
netic disturbances. A relationship between individual aurorae and accompa- 
nying geomagnetic disturbances had already been noticed by Anders Celsius 
(1701-1744) and Olof Peter Hiorter (1696-1750) in 1747 [88, 223] and by 
Alexander von Humboldt (1769-1859) in 1806 [243]. The spatial distribution 
of aurorae suggested an involvement of the geomagnetic field, too. Aurorae 
always were known as a high-latitude phenomenon. But in the 1840s the ill- 
fated Arctic explorer John Franklin (1786-1847) noticed that the frequency 
of aurorae does not increase all the way towards the pole [169]. In 1860, Elias 
Loomis (1811-1889) showed that the highest incidence of aurora is seen inside 
an oval of 20°-25° around the magnetic pole [328]. In 1881 Hermann Fritz 
(1830-1883) [173] published similar results with his famous map of isochasms 
(see Fig. 8.43). 

During some ten years, out of these observations and statistical correla- 
tions a closed picture of solar-terrestrial relationships emerged with the Sun 
as a source of geomagnetic activity as well as aurorae. In the late 1870s, Henri 
Becquerel (1852-1908) offered the first physical explanation: the sunspots are 
assumed to be a source of fast protons [45]. On hitting the Earth’s magnetic 
field, these particles are guided towards the auroral oval by the magnetic 
field. Despite its simplicity, the model contains a revolutionary aspect: the 
Sun is not only a source of electromagnetic radiation but its blemishes, the 
sunspots, are also a source of energetic particles which could affect the terres- 
trial environment. In the early twentieth century, a similar idea led Kristian 
Birkeland (1867-1917) to build the terrella, a model of the Earth which allows 
simulations of the aurora in the laboratory: a cathode-ray tube substitutes 
for the Sun as a source of energetic particles and a magnetic dipole inside a 
sphere covered by a fluorescent material simulates the Earth’s magnetic field 
surrounded by its atmosphere. With these ingredients, Birkeland showed that 
the geomagnetic field was responsible for the formation of the aurora ovals. 
From the correlation between aurorae and the number of active regions on 
the Sun, Birkeland suggested that sunspots might be the source of a con- 
tinuous stream of particles [50]. This idea was an early introduction of the 
concept of a plasma flow from the Sun, which later evolved into the concept 
of the solar wind. 

In the 1930s, Sydney Chapman and Victor Ferraro developed an idea 
about solar-terrestrial relationships which comes closer to our current under- 
standing: sunspots are indicators of solar activity [95]. Solar activity manifests 
itself in violent eruptions, called solar flares. Chapman and Ferraro suggested 
that flares not only emit electromagnetic radiation but also fling out clouds 
of ionized matter which in size dwarf the Earth. After a travel time of 1 
to 3 days, such a cloud might hit the magnetosphere and compress it, lead- 
ing to geomagnetic disturbances. Some of the cloud matter might penetrate 
the magnetosphere close to the poles, causing the aurora. The existence of 
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these proposed clouds, today called coronal mass ejections (CMEs), was first 
confirmed in the early 1970s by Skylab. 

While their basic idea is accepted even today, Chapman and Ferraro’s 
ansatz still assumes that the geomagnetic field is a dipole. Only the first in 
situ measurements in the magnetosphere and in interplanetary space revealed 
the complexity of the magnetic field surrounding the Earth. But again, in- 
direct evidence had been found long before. In the 1920s, the existence of 
a region of charged particles in the atmosphere, the ionosphere, had been 
discovered because of its effect on radio waves: they propagate far beyond 
the horizon due to reflection off the conducting ionosphere. Motions and 
variations in charge density also can be used to explain the atmospheric con- 
tribution to the Earth’s magnetic field which had been proposed 70 years 
earlier by Gauss. Thus, early researchers in magnetospheric physics knew of 
a conductive layer at a height of some tens of kilometers. In the early 1950s, 
in Arctica and Antarctica, van Allen and colleagues launched rockets to a 
height of about 110 km. Their instruments confirmed the existence of en- 
ergetic electrons in this region, either directly or by observing the electron 
bremsstrahlung: the existence of the ionosphere had been confirmed by in 
situ measurements. In 1958, a Geiger counter on board the first US satellite. 
Explorer 1, detected the Earth’s radiation belts, later named van Allen belts 
to honor their discoverer. In the same year, the Soviet lunar probe made the 
first measurements in interplanetary space, confirming the existence of the 
long-proposed solar wind. The first detailed studies of the solar wind were 
made by Mariner 2 in 1962. The boundary between interplanetary space and 
the Earth system, the magnetopause, was first studied by Explorer 10 in 
1961; the bow shock in front of it was detected by Explorer 12 in 1962 and 
studied in detail by OGO (Orbiting Geophysical Observatory) in 1964. These 
and subsequent observations led to the identification of the main components 
of the magnetosphere, as discussed above. 



Exercises and Problems 

1 . 1 . Define a plasma. Discuss the importance of the density. Is there a limit 
for the relative or the absolute size of the neutral component? 

1 . 2 . What parameters are used to characterize a plasma? Briefly discuss their 
physical meaning. 

1 . 3 . What energies/ temperatures can be used to classify a plasma? 

1 . 4 . What do you need to explain a magnetosphere? What determines its 
spatial extent? 



1 . 5 . Describe the basic features of a magnetosphere. How do these properties 
change if the axis of the magnetic field changes with respect to the solar wind 
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direction; if the terrestrial magnetic field decreases; if solar wind pressure and 
speed increase? 

1 . 6 . Why is space dominated by plasmas? 

1 . 7 . Where in the near-Earth environment do plasmas exist? Would we miss 
them if they were neutral matter instead? 




2 Charged Particles in Electromagnetic Fields 



I’ve gotten a rock, I’ve gotten a reel, 
I’ve gotten a wee bit spinning-wheel; 
An’ by the whirling rim I’ve found 
how the weary, weary warl goes round. 

S. Blamire, I’ve gotten a rock 



In space physics the motion of charged particles in electric and magnetic 
fields often is described by a test particle approach: the particles are guided 
by the field but their motion does not affect the field. This approach is valid 
if the energy density of the magnetic field exceeds that of the particles. In 
the test particle approach the motion can be separated into two parts: the 
motion of a guiding center of the particle orbit and a gyration around it. 
The guiding center motion can be interpreted as the effective motion of the 
particle, averaged over many gyrations. This concept is applied to drifts in 
stationary electromagnetic fields. The adiabatic invariants allow simple esti- 
mates of the particle motion in slowly varying fields; they are applied to the 
motion of particles in the Earth’s radiation belts. This chapter starts with a 
brief recapitulation of the basics of electromagnetic field theory. 



2.1 Electromagnetic Fields 

Particle densities in interplanetary space and in the magnetosphere are low. 
Thus a description of the electromagnetic field in a vacuum is sufficient. Then 
the permeability // and the permittivity e both equal 1: the medium can be 
neither magnetized nor polarized. In principle, we can assign a permittivity 
to a plasma, but this is just another description of the existence of charged 
particles. A plasma can also be magnetized: for instance, an axial-symmetric 
ring current of charged particles in a dipole field leads to a reduction of the 
magnetic moment, which is a diamagnetic eflFect. An equivalent description 
is the magnetic induction JB in a vacuum, consisting of both the dipole field 
and the field disturbance, and the current arising from the particle motion. 
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2.1.1 Maxwell’s Equations in Vacuum 

In 1873 Maxwell (1831-1879) introduced a unified theory of the electromag- 
netic field giving for the first time the set of four partial differential equations 
that today bear his name [338] . The sources of the electric and magnetic fields 
are charges, magnetized bodies, and currents, which can be either discrete or 
continuous and either stationary or time-dependent. 

The electric field E generated by a charge density Qc is described by 
Poisson’s equation:^ 

V • E = Qc/sq • (2-1) 

Since the electric field is non-rotational, it can be expressed by the gradient of 
the scalar Coulomb potential (f:E = — V(/?. Integrating (2.1) over a volume 

V and using Gauss’s theorem (A. 33), Poisson’s equation can be rewritten as 
Gauss’s law for the electric field: 

^ E-dS^ y dV . (2.2) 

0(V) V 

Gauss’s law states: the electric fiux through a surface S enclosing a volume 

V is determined by the total charge inside V, If there are no net charges 
enclosed in V, the flux through S is zero. But V is not necessarily field- free, 
e.g. if V is placed inside a dipole field either enclosing none of the charges or 
both of them. Using a spherical test volume V with radius r. Coulomb’s law 
can be derived from Gauss’s law. 

Gauss’s law for a magnetic field is formally analogous, V • B = 0, or 



6 B-dS = 0. (2.3) 

0(V) 

It states that there are no magnetic monopoles. 

Faraday’s law describes the electric field (or electro-motoric force EMF) 
generated by a changing magnetic field: 

BB 

WxE = -—. (2.4) 

The magnetic flux $ through a surface S in the magnetic field is defined as 

^ Equations are given in SI units throughout. However, since the cgs system still is 
widely used in geophysics and space physics, equations which are frequently used 
to determine parameters also are given in cgs units in App. A. 2. The cgs system is 
advantageous in so far as the absolute permittivity and the absolute permeability 
both equal 1 compared with £o = (47 t • 9 • 10^)“^ F/m = 8.854 • 10“^^ F/m and 
/io = 47t • 10~^ H/m = 1.256 • 10“® H/m in the SI system. In the cgs system, 
on the other hand, occasionally a factor c = 1 /a/^oMo appears. If quantities in 
SI units are inserted into equations given in the cgs system, or vice versa, the 
analysis of units automatically leads to the correct consideration of £o and /xo- 
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^ = j B dS . (2.5) 

Graphically, the magnetic flux can be interpreted as the number of held lines 
going through S. Using this definition and Stokes’s theorem (A. 39), Faraday’s 
law can be rewritten as 

c o 

It states that a change in the magnetic flux through a surface creates an 
EMF in its circumference. The minus sign indicates that a current generated 
by the EMF causes a magnetic field anti-parallel to the original one (Lenz’s 
rule). Faraday’s law has two consequences: a stationary magnetic field does 
not produce an electric field. And if the electric field is zero, the magnetic 
field is stationary. 

Ampere’s law describes magnetic field generated by a time-dependent 
electric field E and a current density j: 

dE 

V X ^ = jiQj -h . (2.7) 

The last term on the right-hand side is the displacement current. It is related 
to the equation of continuity: by taking the divergence of (2.7) we get 

V • (V X B) = /xqV • j + /to£oV • ^ = MoV • {Qcv) + • (2.8) 

Since the divergence of a rotational field (left-hand side) vanishes (A. 25), this 
gives the equation of continuity for charges (see Sect. 3.1.3): 

^ + V • {qcv) = 0 or ^ + ^cV • r = 0 . (2.9) 

Using Stokes’ theorem, (2.7) can be written as 

j B-dl = fioJ j-dS + fioeoj^-dS . (2.10) 

c 

Ampere’s law states that a changing electric field and/or a current creates a 
rotational magnetic field. 

Maxwell’s equations for the electric and magnetic field are symmetric, 
except for the fact that there are neither magnetic charges (there are no 
magnetic monopoles) nor magnetic currents. 

2.1.2 Transformation of Field Equations 

In space physics, fields and plasmas move with respect to the observer: the 
solar wind is swept across a spacecraft in interplanetary space, another space- 
craft crosses through the radiation belts of a planet. The fields E^B and 
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E',B' in two reference frames C and C", with C' moving with velocity v 
with respect to C, are related by the relativistic transformations 

1 r 17 

E' = - E -V — {v ' E) {1 — V X B and 
7 L 

B' = - B + ^{vB){1-^)-\vxe] , (2.11) 

7 L 

with 7 = ^yl — jc?. In the non-relativistic case, all terms in j(? can be 
ignored (7 -> 1), reducing the equations to 

£?' = £J + i;xJ5 and B'^B-\vy.E. (2.12) 

Equation (2.12) implies that field components parallel to the direction of 
motion remain unchanged. 

Within the framework of this book, the second set of transformations will 
be sufficient, considering effects of order vjc (e.g. the Doppler effect) but 
ignoring effects of order j(? (e.g. the Lorentz contraction). 

We shall frequently encounter one consequence of the field transformations 
for space plasmas, namely the v x B electric induction field (the second 
term on the right-hand side of the first equation in (2.12)): the convection 
of a magnetic field B with a plasma moving at speed v leads to an electric 
induction field v x B. Applications include the electric field in the front of 
non-parallel shocks (Sect. 7.5.1) and the electric field in interplanetary space 
leading to the corotation of energetic particles (Sect. 6.3). 

2.1.3 Generalized Ohm’s Law 

Ohm’s law connects the current density j and the electric field E hy a, con- 
stant, the conductivity a: 

j^aE . (2.13) 

Note that often, e.g. in the ionosphere, the conductivity is anisotropic and 
should be described by a tensor rather than a scalar (Sect. 8.3.2). 

Equation (2.13) is valid in the plasma rest frame only. If an observer is 
moving with velocity v with respect to the plasma frame, a generalized form 
of Ohm’s law is required. It can be obtained by applying (2.12). The plasma 
has a high conductivity, that is all electric fields except induction fields vanish 
immediately, and thus the current on the left-hand side of (2.13) transforms 
under consideration of (2.7) as j = j'. The generalized form of Ohm’s law 
then reads 

j = a{E-{-v X B) . (2.14) 

The second term on the right-hand side describes the electric induction field 
which gives rise to the Hall current. 
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2.1.4 Energy Equation of the Electromagnetic Field 

Prom Faraday’s law we can derive an energy equation for the electromagnetic 
field. Multiplication of Faraday’s law (2.4) by B and integration over a volume 
V gives 

j B -^d^r = - J B -{V X E)d^r . (2.15) 

V V 

The divergence of a vector product can be written as (see (A. 29)) 

V • (E X J5) = S . (V X E) - E . (V X B) . (2.16) 

The second term on the right-hand side can be rewritten using Ampere’s law 
(2.7). Solving for the first term on the right-hand side we get 

1 oin 

B ■ X E) = V ■ {E X B) + (loE ■ j + ~ . (2.17) 

Inserting into (2.15) gives the the energy equation 

j B-^d^r = ~ j V-{ExB)d^r- J fioE-jd^r 



With Gauss’s theorem (A. 33), the volume integral in the first term on the 
right-hand side can be changed into a surface integral. The first term on the 
left-hand side and the last terms on the right-hand side contain the product 
of a vector and its temporal derivative. This is equal to half the temporal 
derivative of the squared vector as can be seen by differentiating the middle 
term in 

da _ 1 d{a • a) _ 1 da^ 

^ dt 2 dt 2 dt 
The energy equation then reads 



(2.19) 






0{V) 



{E X B) 
Mo 



dS 



-i 



E- jdV . (2.20) 



Here B^/2/io and SoE‘^ j2 are the energy densities of the magnetic and the 
electric fields. Equation (2.20) can be interpreted as an equation for the 
continuity of the electromagnetic field: the change in the energy density of the 
electromagnetic field inside a volume V is given by the energy fiux (Poynting 
vector 5p), 

Mo 



( 2 . 21 ) 
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through the surface of the volume and ohmic losses E • j inside V ; positive 
values of ^ • j indicate losses, while negative ones describe a generator. 

If we consider electromagnetic fields in matter, the field exerts a force on 
the particles, described by the j x B term in the equation of motion (3.28). 
Thus, the energy equation has to be supplemented by a term describing the 
work done by the field: 



1 f + 

dt J \2fxo 2 



0{V) V 

— J u- {j X B) d^r . 

V 



E • j d^r 



( 2 . 22 ) 



A comparison of the energy densities of the field and the plasma shows 
whether the particle motion will be governed by the electromagnetic fields or 
by the gas laws: if the plasma’s energy density is high and the conductivity is 
infinite, the field is frozen-in (Sect. 3.4.1) and carried away by the plasma (e.g. 
the interplanetary magnetic field frozen-into the solar wind). If the energy 
density of the field is high, the field determines the motion of the particles 
(e.g. energetic particles in interplanetary space or in the radiation belts). This 
latter case is discussed in this chapter. 



2.2 Particle Motion in Electromagnetic Fields 

Let us now turn to the motion of individual charged particles in a prescribed 
electromagnetic field. Particle densities are assumed to be small: there are no 
collisions between particles, the particle motion is determined by the fields 
only. The energy density of the particles is small too; thus their motion does 
not modify the external field. 

Although these limitations are strong, the resulting motion and their for- 
mal description are basic and instructive. Table 2.1 places them into the 
general framework of particle motion in electromagnetic fields. Fields can ei- 
ther be smooth or turbulent. For smooth fields, two cases of particle motion 
can be distinguished: (a) The field varies only weakly on the temporal and 
spatial scales of the gyration and the particle motion can be described by the 
concepts of guiding center motion and adiabatic invariants (Sect. 2.4). These 
concepts can be applied to particles in the radiation belts (Sect. 8.7.1). (b) 
The field changes significantly during one gyration of the particle. The above 
concepts are no longer valid and the equation of motion has to be integrated. 
One example are the Stprmer orbits of galactic cosmic rays in the magneto- 
sphere (Sect. 8.7.2). Turbulent fields require an entirely different approach. 
The particle motion is determined not only by the average magnetic field 
but also by scattering at field fiuctuations, a stochastic process. Formally, we 
have to consider particle ensembles instead of single particles and transport 
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Table 2.1. Particle motion in different types of magnetic fields. The characteristic 
length scale L for changes in the magnetic field is defined as 1/L = {1/\B\) dB /dx 



Field 




analytical description possible 




turbulent, irregular 
fast fluctuations 



Scales small variations in 


large variations 


weak turbulence 


space and time 




6<|<B>| 


tl « L 


B =< B> +b, 


strong turbulence: 




^ L > L 


1 < B > 1 


Formalism adiabatic invariants; 


integration of the 


transport equations 


guiding center 


equation of motion pitch-angle scattering 


motion; drifts 




resonance interaction 


Occurrence periodic motion in 


Stprmer orbits 


particle propagation 


radiation belts. 




in interplanetary 


magnetic mirrors 




space 


and bottles 







equations instead of equations of motion. Propagation then can be under- 
stood as a diffusive process (Chap. 7.3); applications are the interplanetary 
transport (Sect. 7.4) or particle scattering into the loss cone in the radiation 
belts (Sect. 8.7.1). 



2.2.1 Lorentz Force and Gyration 



The general equation of motion is Newton’s second law F = dp/dt. The 
net force on a particle can consist of different components, e.g. gravitation, 
electromagnetic forces, and a pressure gradient. In a pure electromagnetic 
field only the Lorentz force acts on a particle 



^ du _ 

Fl = m— =q{E^-v X B) 
at 



(2.23) 



First Integral of Motion. In a pure magnetic field the electric field is zero. 
According to Faraday’s law (2.4), the field then is stationary. The equation 
of motion reduces to 



du _ 

m— = qv X B . 
dt ^ 



(2.24) 
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Multiplication by the particle speed v and consideration of (2.19) gives the 
first integral of motion: 



1 dv^ 
-m—— 

2 dt 



dW^kin 



= qv ' {v X B) 



because the cross-product v x B is perpendicular to v and thus its scalar 
product with v vanishes. The first integral of motion (2.25) states that in a 
pure magnetic field the kinetic energy Wkin of a particle is constant. Wkin is 
given in electronvolts (eV) where 1 eV = 1.602 -10“^^ J is the energy gained 
by an electron after traversing a potential difference of 1 V. 



Gyration. Let us now assume a homogeneous magnetic field along the z 
axis: B = Bcz- The equation of motion for the components then reads 

'^'^x = QBvy , mvy = —qBvx , and mVz = 0 . (2.26) 

Integration of the last equation gives t;|| = Vz = const: the particle moves 
parallel to the field line with constant speed Vz> The other two equations 
are coupled ordinary differential equations (ODEs) of first order. They can 
be combined into two separate ODEs of second order by first differentiating 
them with respect to t and then inserting the other ODE: 



.. qB . (qBV , qB . 

Vx = — '^y = - \ — and Vy = Vx 

m \ TU J m 



m 




(2.27) 
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The direction of motion depends on the particle’s charge and has to obey 
Lenz’s rule: the ring current associated with the particle motion creates a 
magnetic field opposite to the external one. An electron obeys the right-hand 
rule (see Table 2.2): if the thumb is directed along the magnetic field line, 
the tips of the curved fingers give the direction of the electron motion. If the 
initial velocity has a component parallel to the magnetic field, the particle 
follows a helical path around the line of force. 

Example 1. A hot plasma with T = 1 keV is confined in a ho mogeneous 
magnetic field B = 1 T. The thermal speeds are given hy v = 

Thus we obtain = 18.7 x 10^ km/s for the electrons and Vp = 4.37 x 10^ m/s 
for the protons. The Gyro-radii then are determined from (2.32) to be Vq = 
0.1 mm and Vp = 4.6 mm. From (2.28), we get obtain (jUc,e = 1.8 x 10^^ s~^ 
and Uc^p = 10^ s"^ for the cyclotron frequencies. Note that both Uc and 
fL scale with B; thus in a much weaker field, such as the interplanetary 
medium, which is of the order of a few nT, the gyro radii would increase by 
many orders of magnitude while the cyclotron frequencies would decrease by 
orders of magnitude. □ 



2.2.2 Useful Definitions 

The magnetic rigidity P [V] describes the resistance of a particle to change 
its direction of motion under the infiuence of a magnetic field. It is defined 
as the ratio of the momentum p± perpendicular to B and the charge q: 

P = y . (2.33) 

The Larmor radius (2.32) then is the ratio between the magnetic rigidity and 
the magnetic field strength ri, = PjE. 

The gyration of the particle is determined by its speed v± perpendicular 
to the magnetic field while the particle itself is characterized by its energy or 
velocity. The relative sizes of the velocity components parallel and perpen- 
dicular to the magnetic field can be described by the pitch angle a with 

tano = — . (2.34) 

^11 

The velocity components perpendicular and parallel to the field therefore are 
v± = V sino; and = v cos a. 

A particle with charge q moving in a circular orbit with radius tl and 
gyration time Tc gives rise to a ring current / = q/T^ = qUc/{2'K). The 
magnetic moment ji is defined as the product of the ring current and the 
enclosed area A — TTr^: 



H = \I\A = 



mv\ 






2B 



B 



(2.35) 
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The orientation of the magnetic moment is determined by the direction of 
the ring current: 



1 mv]_ B B 

H = -qr^ xv± = ^ 



B 






(2.36) 



where Ib = B/B is the tangential unit vector to the magnetic field. The 
magnetic moment does not depend on the charge of the particle; its direction 
is opposite to that of the external field. A plasma, therefore, is diamagnetic. 

In a highly conductive plasma, no electric field exists and the particle en- 
ergy is conserved (see (2.25)). Then is constant. Therefore v± is constant, 
too. If the magnetic field is constant, the magnetic moment will also be con- 
stant. Even in the case of a slowly (adiabatically) varying field /x is constant. 
It is therefore called an adiabatic invariant (Sect. 2.4.1). 



Excursion 1. Relativistic Quantities. The highly energetic particles acceler- 
ated in solar flares or coming from the galactic cosmic radiation have speeds 
close to the speed of light. Thus these particles have to be described in terms 
of relativistic quantities. The mass of a particle with rest mass mo increases 
with increasing speed v, as described by the relativistic mass equation 



m{v) = = 7mQ • (2.37) 

The relativistic energy is then 

E = me? , (2.38) 

the relativistic momentum is 

W^- 

p = m{v) V = 'ymov , and ^ — moc^ . (2.39) 

Consequently, the relativistic force can be written as 



^ d , 

■F = ^ (irrnov) . 

The cyclotron frequency of a relativistic particle then is simply 

_\q\B \q\B 

^c,rel — — ) 

rnrel 7^0 

and the Larmor radius is 

_ v± _ P± 

^L,rel — “ I I D * 

^c,rel 1^1 B 



(2.40) 



(2.41) 



(2.42) 

□ 
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Example 2. So far, the highest proton energies observed in the galactic cosmic 
radiation are 10^^ eV. These protons gyrate in an interstellar magnetic field of 
about B = 3x 10“^^ T. Since for such high energies the kinetic energy Wkin is 
approximately equal to the total energy E'totab second term on the right- 
hand side of the second equation in (2.39) can be ignored and the particle 
momentum is p = M^kin/c. With (2.42), we obtain a maximum Larmor radius 
of 

eB 



tl 



ceB 



= 10 ' 



,21 



m 



(2.43) 



This is approximately the size of the Milky Way. 



□ 



Excursion 2. Local Gyration Radius. The example above points to a general 
problem, already mentioned in connection with Table 2.2: for a particle with 
pitch angle a = 90° , a closed gyro-orbit with a constant tl results only if the 
magnetic field is homogeneous across the particle orbit. This is certainly not 
true for the Milky Way, and it is also not true for the path of cosmic rays in 
the magnetosphere. In these cases, a different approach is required. Again, 
the particle motion is decomposed into motions parallel and perpendicular 
to the field, and the equation of motion is separated into two parts, 

^1=0 and = i. X B) . (2.44) 

dt dt ^ ^ ^ 

We assume a stationary magnetic field, dB/dt = 0; the direction parallel 
to the field lines is given by the tangential vector ts = B/B. Instead of a 
gyration radius for the entire orbit, we now can determine a “local gyration 
radius” which gives the local curvature of the particle path; see Fig. 2.1. 

The equation of motion gives us a closed gyro-orbit for B = const over the 
entire orbit. For B = B{r^t) it gives us the local Lorentz- force from which 
we obtain the local gyro-radius. Locally, its magnitude is the same as for a 
constant field with the local value of B. We can derive this also graphically: 
During a time interval 6t the radius of curvature Tc changes in accordance 




Fig. 2.1. Particle orbit in a stationary but non- 
homogeneous magnetic field 
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with Svc = v± 5t, while the perpendicular speed changes in accordance with 
Sv± = (q/m) ('U_L X B) St. Since the triangles in Fig. 2.1 are similar, we 
obtain Svc/vc = 6v±/v±. Solving for gives the curvature radius or local 
Larmor radius 



mu_L 



(2.45) 



or, in vector form, where = v/v \s the tangent vector to the velocity. 



r*iL 



mv±_ 

W 



(tv ^ ^b) 



(2.46) 



The equation gives essentially the same result as (2.32). The only difference 
is that now riL depends on position: riL = r\\^{r) because B = B{r). Thus 
riL and riL both vary along the particle orbit, while in (2.32) tl is constant 
along the particle path. □ 



2.3 Drifts of Particles in Electromagnetic Fields 

Particle drifts in electromagnetic fields result from changes in the Larmor 
radius during one gyration, either due to changes in particle speed (as in 
E X B and in the gravitational drift) or in the magnetic field (as in the gra- 
dient drift). For particles with pitch angle 0° the drifts vanish, except for the 
curvature drift, because the latter arises from the field-parallel motion. 



2.3.1 The Concept of the Guiding Center 

The concept of the guiding center, introduced in the 1940s by H. Alfven, 
separates the motion v of a particle into motions vy parallel and v± perpen- 
dicular to the field. The latter can consist of a drift ud and a gyration u?: 

V = V\\ V± = V\\ + vd (aJ = Ugc -h uJc (2.47) 

with Vgc being the motion of the guiding center. Thus the motions Vgc of the 
guiding center and the gyration c^c around the magnetic field are decoupled. 
If we follow the particle for a long time, the gyration is of minor importance. 
In some sense it is averaged out, and the particle motion is described by the 
motion Ugc of the guiding center, consisting of a field-parallel motion and a 
drift. The particle always is within a gyro-radius of this position. 

2.3.2 Crossed Magnetic and Electric Fields: E X B Drift 

Assume an electric field E perpendicular to the magnetic field S, both ho- 
mogeneous and constant in time. The magnetic field forces the particle into 
a gyration around the field line. During half of its orbit, the particle motion 
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Table 2.2. Drift in various types of fields 



B upward Charge eq. 

through the paper positive negative 



B homogeneous 




E homogeneous 




Gravitation 


JUUJUl 

^vp 'gb 


Inhomogeneous B 
jgrad \B\ 


t JLQjJJl""’ 

^VP 



has a component parallel to the electric field, during the other half it is anti- 
parallel. Therefore the particle alternately is accelerated and decelerated. 

Let us now look at this motion in detail. The second row of Table 2.2 
gives the motion of particles in a magnetic field pointing upwards through the 
paper and an electric field pointing downwards. An electron (right column) 
starts in the upper right corner and is forced into a downward motion by the 
magnetic field. Since this motion is parallel to the electric field, the electron 
decelerates and its gyro-radius decreases. At the lowest point of its orbit 
the electron therefore is shifted towards the left with respect to its starting 
point. In the upward part of its gyro-orbit the electron moves anti-parallel 
to the field. Thus it is accelerated and its Larmor radius increases, shifting 
the electron farther to the left. This E x B drift results from a continuous 
change of the particle’s gyro-radius and is perpendicular to both the electric 
and magnetic fields. Therefore, in spite of the presence of an electric field 
the particle does not gain energy. For a particle with positive charge, the 
direction of the drift is the same: its gyro-motion is opposite to that of an 
electron, as are the parts of its orbit with the smallest and largest gyro-radii. 
Since electrons and protons drift into the same direction, no current results. 

To derive a quantitative description of the E x JB-drift we substitute 

E X B 

^2 



V = w -\- 



(2.48) 
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This corresponds to the transformation into a frame of reference moving with 
velocity E x It is i) = w because both the electric and magnetic fields 

are constant. The equation of motion therefore reads 

mw = mi) = qE + qw x B + x B) x B . (2.49) 

Because E is perpendicular to jB, the double cross-product in the last term 
can be written as —EB^ (A. 20). Therefore, the first and last terms on the 
right-hand side cancel and the equation of motion reduces to 

mw = qw X B . (2.50) 

Now w fulfills (2.24). In the new reference system the particle motion there- 
fore is a gyration and the motion of this system gives the drift velocity: 

E X B , . 

VexB = • (2*51) 

The direction and size of the drift depend on the fields alone: particle prop- 
erties such as mass, charge or velocity do not enter into the drift. 

We can also derive a general equation for the drift velocity in the presence 
of a general force F perpendicular to the magnetic field by substituting the 
electric field E by the general force F/q in (2.48) and (2.49): 



F X B 



1 /F B 

— — X 

(jJc \m B 



X Ib . 

ujcm 



(2.52) 



In the above example F = qE has been used. 



Example 3. Wien filter. A magnetic field B = 5 x 10 T is perpendicular to 
an electric field E = 1000 V/m. This configuration provides a simple example 
of E X B drift. Since the two fields are perpendicular, we obtain from (2.51) 
a drift velocity vexb = {EB)/B‘^ = EjB = 2 x 10^ m/s. The drift velocity 
is perpendicular to both the electric and the magnetic field. An electron 
approaches perpendicular to both fields. On hitting the field combination, 
the electron will start to gyrate around the magnetic field and drift along 
its original direction of motion. The size of the gyro-orbit depends on the 
electron speed Ue. For Vq = vexB the electron moves along a straight line. 
This can be understood as follows: if the electron moves along a straight line 
at constant speed, no force acts on the electron. Thus the forces exerted by 
the electric and the magnetic field must be equal: eE = ev^B or Ve = E/B^ 
which is the drift velocity derived above. □ 



2.3.3 Magnetic and Gravitational Fields 

Now consider a gravitational field perpendicular to a homogeneous magnetic 
field (third row in Table 2.2). As in the previous case, the Larmor radius 
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changes continuously. But here the external force does not depend on the 
charge sign, and the points of the largest and smallest gyro-radii are the 
same for both positive and negative particles. Because particles with different 
charge signs have opposite directions of gyration, they drift into opposite 
directions. The direction of drift therefore depends on the charge q, and its 
size depends on the particle mass m, because the external force F = mg 
depends on the particle mass. Equation (2.52) yields for tjie drift velocity 



V 



9 ~ 



mg X B 

7 



(2.53) 



A gravitational field perpendicular to a magnetic field therefore allows the 
separation of particles with positive and negative charges. This drift leads to 
a current. In addition, the magnetic field prevents the particles from “falling 
down”. There is no net acceleration along g and the potential and kinetic 
energies averaged over a gyration both are constant. 



Example A proton with a kinetic energy of 1 keV (and also a 10 keV 
electron) gyrates in the equatorial plane of the terrestrial magnetic field at a 
radial distance of five Earth radii from the center of the Earth. All its kinetic 
energy is in the gyration. The equatorial magnetic field at the surface of the 
Earth is 3.11 x 10“^ T; it falls off with radial distance as . Thus the 
local field at the proton orbit is B = 2.5 x 10“^ T. The proton speed is Vp = 
v/mw m = 4.4 X 10^ m/s (and the electron speed is Ve = 5.9 x 10^ m/s); its 
gyro-radius according to (2.32) is rL,p = 1.8 x 10^ m (rL,e = 1.4 x 10^ m/s), 
which is still small compared with the scales of the system; for instance, 
the drift path around the Earth has a length of /drift = 27rr = 507rrE = 
2 X 10^ m. The gravitational acceleration scales with r^, and thus at the 
particles position it is only g/2b^ g being the gravitational acceleration at the 
surface of Earth. Since at and above the equator the magnetic field is parallel 
to the surface, the gravitational field is perpendicular to the magnetic field. 
The drift speed from (2.53) is then Vg = 1.6 cm/s perpendicular to both 
fields: the particle drifts along a circle in the equatorial plane. Since the 
geomagnetic field has its south pole close to the geographic north, the field 
in the equatorial plane is directed northward. Thus a proton drifts to the 
west while an electron drifts eastward. The g x B drift thus would give the 
particle such a small speed that it would take 1.25 x 10^^ s or about 400 years 
to drift around the entire Earth. Note that the drift speed depends only on 
the particle mass. Thus all protons drift with the same speed, independent 
of their energy. To be more precise, almost all protons: if the energy becomes 
too large, the gyration radius of the particle may become so large that it 
either hits the atmosphere and is absorbed during an interaction or suddenly 
finds itself in interplanetary space and escapes. The same argument holds 
also for all other particle species; only their drift speeds must be scaled by 
the ratio of their mass to the proton mass. The drift speed of an electron is 
therefore smaller by a factor of 1836 than that of the proton. □ 
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2.3.4 Inhomogeneous Magnetic Fields 

In an inhomogeneous magnetic field (bottom row in Table 2.2) particles with 
positive and negative charges also drift in opposite directions. In contrast 
to the previous example, here the particle speed stays constant during the 
gyration but the gyro-radius increases/decreases with decreasing/increasing 
field strength. This change in gyro-radius is independent of the charge sign, 
leading to drifts of electrons and protons in opposite directions. The resulting 
gradient drift is 

vvb = ^BxVB = ±^BxVB-, (2.54) 

the ib sign refiects the charge dependence of the drift direction. In contrast 
to the gravitational drift discussed above, here the drift speed depends on 
the particles kinetic energy and thus on both speed and mass. 

A very efficient drift develops in the configuration of two opposing mag- 
netic fields. Imagine the guiding center of the particle orbit on the neutral 
line between the fields. The particle starts its gyration in the upper half of the 
field, but after crossing the neutral line, the direction of gyration is reversed. 
Therefore, instead of a closed circular orbit two semicircles result, leading to 
displacement by 4 Larmor radii during one gyration (see Fig. 2.2). In inter- 
planetary space such a drift takes place along the heliospheric current sheet 
and contributes to the modulation of galactic cosmic rays (Sect. 7.7). 

In the magnetosphere this drift leads to the ring current and the motion 
of particles trapped in the radiation belts (Sect. 8.7.1). 

Example 5. Let us briefly return to example 4, where we saw that the g x B 
drift is extremely slow. Since the magnetic field decreases as r“^, during 
its gyration the particle scans regions of different magnetic field strength, 
and thus a gradient drift results. The gradient of the magnetic field has the 
same direction as the gravitational acceleration, and thus the gradient drift 
is in the same direction as the g x B drift. Although the magnetic field is 
roughly a dipole field, in this special case it can be treated as spherically 
symmetric since the particle gyrates in the equatorial plane and therefore 
is not influenced by the latitudinal variation of B. From (A.43) we obtain 
the gradient of the magnetic field B{r) = Bq (ro/r)^ as VS = (-B/r, 0, 0). 
The cross product in (2.54) then gives \B x VB| = B^/r and the drift speed 

^ Vb(e“) -^vd(p'^) 

OOOOOOOOOO B± 

Fig. 2.2. An efficient gra- 
dient drift occurs in adja- 
cent fields of opposite polar- 
ity, for instance along the he- 
X X X X X X X X X X B 2 = —B^ liospheric current sheet 
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becomes = ri,v±/{2r) = 220 km/s for the proton. For the 10 keV electron 
we get a drift speed of 259 km/s. Thus drift frequencies are of the order of 
mHz (see also Fig. 8.47). □ 



2.3.5 Curvature Drift 



Imagine a homogeneous magnetic field with the lines of force are curved with 
a radius Tc. In a vacuum such a field would obey Maxwell’s equations only 
if combined with a magnetic field gradient. The net drift therefore would be 
the sum of the curvature drift and the gradient drift. To derive the curvature 
drift alone, let us start from a simplified configuration without a magnetic 
field gradient. The drift arises from the centrifugal force Fcf and thus is 
determined by and not by v± as in the previous examples. Inserting the 
centrifugal force = mt’|er/rc = mv‘^^rdr^ into (2.52) yields 



mv?, TcxB 



(2.55) 



The curvature drift depends on the charge, mass, and speed of the particle. 

In a real field, a field gradient exists in addition to the curvature. Ampere’s 
law (2.7) in a vacuum without electric current gives V x B = 0. In cylindrical 
coordinates (see Sect. A. 3. 2 and Fig. 2.3), B has only one component in 0, 
VH only one in r, and V x J5 only one in z, given as 



(V X B), = 



1 d{rBe) 
r dr 



= 0 . 



(2.56) 



This is zero because Be is proportional to 1/r. Then |-B| is proportional to 
1/vc and it is V|B|/|B| = The gradient drift (2.54) then reads 



m 



2 rc X J5 






(2.57) 



Combined with (2.55), the drift in a curved magnetic field is 



Vr + v^B = 



mvc X B 
q r^B‘^ 



^11 + 



Vn + 









Ur 



X ^ . (2.58) 
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Fig. 2.3. Curved magnetic field and coordinate system 
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This equation has an important implication: if particles are trapped inside a 
magnetic field torus, the best and finest adjustments in temperature and 
field cannot prevent the particles from drifting across the field lines and 
out of the torus sooner or later. This is one of the fundamental problems 
in fusion research. Instead of a simple torus, extremely complex field con- 
figurations are required, as for instance in the stellerator Wendelstein 7-X 
(see e.g. www . ipp-garching . mpg . de/eng/pr/publikat ionen/broschuere . 
engl.pdf or www.ipp-garching.mpg.de/eng/pr/exptypen/stellarator/ 
pr_exp_st e . html) . 

Example 6. A prot on plasma with a temperature of 10 MeV (corresponding 
to a speed of ^’ = y/2W^i^Jm = 4.3 x 10^ m/s) is confined by a homogeneous 
magnetic field inside a torus of diameter 1 m. The dimensions of the torus also 
give the diameter of the gyro-orbit. Thus a magnetic field B = mv±/{qri,) = 
0.9 T is required to keep the proton inside the torus. According to (2.55), 
the curvature drift is then = mv‘^ri,/{qB) = 0.24 m/s tangential to the 
particle path. □ 

2.3.6 Drifts Combined with Changes in Particle Energy 

The drifts discussed so far have been associated with acceleration in the 
sense of a change in the direction of motion but not in average speed. Cer- 
tain combinations of fields, however, can lead to changes in average speed 
and therefore also in particle energy. Figure 2.4 shows the drift of a particle 
in a crossed electric (pointing upward in the drawing plane) and magnetic 
(pointing upward out of the drawing plane) field with a gradient perpendic- 
ular to both the magnetic and electric fields. The motion under the infiuence 
of two combined fields has been discussed above and is indicated by thin 
lines: for a particle with positive charge the E x B drift leads to a horizontal 
motion towards the right while the grad^ drift leads to an upward motion. A 
combination of both motions gives the EgradB drift oblique to the fields and 
to the magnetic field gradient. Owing to this drift, the particle moves from 
one potential to another, gaining energy. Note that the use of a gravitational 
field instead of the electric field would yield similar results. 
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Fig. 2.4. One example of a 
configuration of fields in which 
the drift of a charged particle 
leads to changes in particle en- 
ergy 
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2.3.7 Drift Currents in Plasmas 

Some of the drifts discussed above lead to the separation of positive and 
negative charges, giving rise to a drift current. In a plasma, instead of single 
particles we have to consider k different particle species with number densities 
rik and masses rrik. With the general equation for the drift velocity (2.52) we 
can derive a drift current: 

JD = VDk ■ (2.59) 

k 

The drift current due to the gradient drift, for instance, is given as 

^'dvb = E ^ • ( 2 - 60 ) 

This current results from the inhomogeneity of the field and leads to a charge 
separation. An example of a natural drift current is the ring current in the 
magnetosphere (Sect. 8.3.3). 

Example 7. In example 5, we calculated the drift speeds of 1 keV protons 
and 10 keV electrons. In the radiation belts, both have a number density 
n = 10^ m“^. With the drift speeds from example 5, we can determine the 
ring current densities by using (2.60) and j — 7.7 x 10“^ A/m^. □ 



2.4 Adiabatic Invariants 

So far we have only considered static and homogeneous fields. In weakly and 
slowly varying fields, the concept of adiabatic invariants provides a powerful 
tool to describe the periodic motions of particles. A simple analogy is the 
mathematical pendulum: if its length increases only weakly during one swing, 
the ratio of the pendulum’s energy and frequency is a constant of the motion, 
called an adiabatic invariant. For charged particles in a magnetic field, three 
types of motion can be identified: 

1. The field changes only slowly during one gyration: 

1 dB LUc 
B dt ^ 2 tt 

2. The field varies only weakly on a scale comparable with the distance trav- 
eled along the field by the particle during one gyration (bounce motion, 
longitudinal oscillation): 



(2.61) 



VBii a;, 
^ < 



2'kv\\ 



B 



(2.62) 
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3. The field varies only weakly in the area encircled by the particle during 
the gyration or drift motion: 



V-gj- 

B 2ttv± B 2ttvb 



(2.63) 



If the particle motion is described by a pair of variables {pi,qi) which are 
generalized momentums and coordinates, for each periodic coordinate qi the 
action integral 

Ji = ^ Pi^Qi ^ (2-64) 

integrated over a complete cycle of motion, is approximately an invariant 
or constant of the motion, provided changes in the variables occur slowly 
compared with the relevant periods of the system and the rate of change is 
almost constant (for a proof of this statement, see, for example, [192,303,307, 
382,508]). Thus a system can change from one state of motion into another 
and still have the same action integral. 



2.4.1 First Adiabatic Invariant: The Magnetic Moment 



The first adiabatic invariant states that in a slowly varying magnetic field 
the magnetic moment /i = W±/B is almost constant. It finds applications in 
magnetic mirrors and bottles. We can derive it by inserting the generalized 
momentum p = mv± and the generalized coordinate q = (with 'll; being 
the azimuthal angle along the gyro-orbit) into the action integral (2.64) 



Ji = ^ Pi dqi == ^ 'mvj_ri, (i'll; = 27rmrLUj_ . 
With (2.32) and (2.35) this yields 



m 



Ji = An— r 11 = const 

m 



(2.65) 



( 2 . 66 ) 



under the tacit assumption that u/u)c <^1 with lo characterizing the change 
in B. Thus (2.66) states: as long as m/\q\ is constant, the magnetic moment 
is constant, too. 

A less formal but more illustrative proof, without using the action in- 
tegral, starts from the motion of a particle in a magnetic field that varies 
slowly in space or time. Let us choose the latter attempt, i.e. dB/dt 7 ^ 0. 
Then, according to Faraday’s law (2.4), a rotational electric field arises with 
a component parallel to the orbit of a gyrating particle (or completely par- 
allel to the motion of a gyrating particle if the latter has pitch angle zero). 
If we further assume to be zero, the particle speed is — t;_L = dl/dt 
with dl being a small element of the particle path. Multiplying the equation 
of motion (2.23) by the particle speed v±^ we derive the temporal change in 
kinetic energy: 
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d / mv‘^ 
dt 



= qE • V = qE • 



dl 

dt 



(2.67) 



Integration over a gyro-orbit gives the energy change during a gyration 



2tt/u) 

j QE^^dt. ( 2 . 68 ) 

0 

According to our assumptions, the field changes only weakly during one gy- 
ration. Therefore, the particle orbit still is nearly circular and the integration 
in time can be substituted by a line integral along the particle path: 

SWkin = ^qE-dl. (2.69) 

With Stokes’ theorem (A. 39) and Faraday’s law (2.4) we obtain 

<5T^kin =qj{VxE)-dS = -qJ^-dS, (2.70) 

s s 



with S being the surface enclosed by the Larmor orbit, its direction given by 
the right-hand rule. The integral is positive for negatively charged particles 
and negative for positively charged ones. It then can be written as 



XU/ 2 

(ilTkin = ±9-^ 7 rr£ . 



(2.71) 



With the Larmor radius expressed in terms of the cyclotron frequency this is 



(5Wkin = ±7TQ 



dBvl m 

dt LUc ztqB 



Wkin 2irdB/dt 

B (jOc 



(2.72) 



The first part of the right-hand side gives the magnetic moment, the second 
part the variation 6B of the magnetic field during one gyration: 5VFkin = /J^SB. 
Inserting the definition of the magnetic moment (2.35) into the left-hand side, 
we get S{iJ/B) = fiSB Bd/i = fiSB^ which implies 



Bdfji = 0 . (2.73) 

Because B 7 ^ 0, this equation gives the first adiabatic invariant: in a slowly 
varying magnetic field the magnetic moment is constant. 

Changes in the magnetic field result in variations in the Larmor radius. 
Thus the question arises: does the magnetic flux through a Larmor orbit 
change in a slowly varying field? With the definition of the magnetic flux 
(2.5) and the area S enclosed by the gyro-orbit we can write 
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Inserting (2.28) yields 






2'Km 



(2.75) 



Thus as long as /x is an invariant of the motion, the magnetic flux $ through 
a gyro-orbit is constant. 



2.4.2 Magnetic Mirrors and Bottles 

Magnetic mirrors and bottles are applications of the first adiabatic invariant. 
In its top left panel Fig. 2.5 shows the configuration of a magnetic mirror 
together with the path of a particle. The panel below shows the dependence 
of r’||, v±^ and a on the particle’s location. The panel on the right illustrates 
the restoring force at the mirror point. 

Let us start with a particle on the left-hand border of the graph with 
initial speeds ?;[| i and v±^i parallel and perpendicular to the magnetic held 
and initial pitch angle a\. The particle gyrates around the field line, and its 
guiding center moves to the right. Since the electric field is zero, the particle’s 
kinetic energy is constant: 

-f v\) = const . (2.76) 

The kinetic energy perpendicular to the magnetic field can be expressed by 
the magnetic moment fi (2.35): 

^mv‘^ = + fiB . (2.77) 





Fig. 2.5. Magnetic mirror: configuration {top left)^ variations in particle speed 
parallel and perpendicular to the magnetic field and pitch angle a with position 
{bottom left)^ and restoring force at the mirror point {right) 
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The first term on the right-hand side can be interpreted as the drift energy 
or the kinetic energy of the guiding center, while the second term fiB is the 
gyration energy. Since /i is invariant, an increase in B has to be compensated 
for by a decrease in the drift energy until uy becomes zero. At this mirror 
point the energy conservation yields 

nBmp = \mv^ = • (2-78) 

Thus, at the mirror point the drift energy entirely is transformed into gyration 
energy, which therefore is the particle’s total kinetic energy. The guiding 
center has come to a standstill and eventually will be reflected back towards 
the diverging field. The right panel in Fig. 2.5 illustrates the origin of the 
restoring force: at the mirror point the magnetic field is inhomogeneous. Thus, 
the plane of gyration is not perpendicular to the field and B has a component 
B± in this plane, leading to the restoring force 

Fr = qv X B = qv±B±€x . (2.79) 



The particle is pushed back into regions of decreasing field strength and its 
pitch angle decreases as gyration energy is transferred back into drift energy. 
Note that (2.79) also can be written as F = fi-VB which describes the force 
an inhomogeneous magnetic field exerts on a dipole magnetic moment. 

The location of the mirror point B^p depends on the initial pitch angle 
ai of the particle. If ai is zero, the magnetic moment is zero too, and the 
particle’s total kinetic energy is drift energy. An increase in the magnetic 
field strength does not transform drift energy into gyration energy and the 
particle traverses the magnetic mirror. If, on the other hand, ai is 90°, all the 
particle’s energy is contained in the gyration and the guiding center already is 
at a standstill. For values of ai in between these two extremes either reflection 
occurs at a point Xmp{(^i) or the particle is transmitted if the increase in the 
magnetic field strength is not sufficient to convert all the drift energy into 
gyration energy. Let us now determine whether a particle will be reflected or 
transmitted. The constancy of the magnetic moment implies that the ratio 
of the energy of gyration and the magnetic field strength is constant. Thus 
for any two points in the magnetic field we have 



mv\2 

2 Bi 2B2 



(2.80) 



or, taking the pitch angle into consideration, 

?;iSin^Q;i _ 0L2 

B\ B2 



(2.81) 



The kinetic energy is constant; thus it is vi = V 2 and the quantity 

2/i sin^ ai sin^ o;2 
mv^ Bi B2 



(2.82) 
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becomes an invariant of the motion. At the mirror point, the particle’s pitch 
angle ttnip is 90°. Thus reflection at the position B^p requires an initial pitch 
angle at B\ of 



. 2 r “^1 ^ 

sin ^1 = -^ = -^ 

-^mp ^mp 

Here R^p is the mirror ratio. Particles with an initial pitch angle a\ are 
reflected exactly at B^p] particles with larger ai are reflected earlier and 
particles with smaller ai pass through the mirror point. Thus (2.83) deflnes 
the boundary of a region in velocity space in the shape of a cone, called the 
loss cone (see Fig. 2.6): particles inside this cone are not conflned by the 
magnetic mirror. This loss cone is an important concept to describe the dy- 
namics of radiation belt particle populations. Note that the loss cone depends 
on pitch angle only, and does not depend on other particle parameters, such 
as mass, speed or charge. 



= arcsm . 



I B, 

-^mp 



= arcsm ^ 



R. 



. (2.83) 



mp 



Example 8. Assume an isotropic distribution (that is, the pitch angles are 
distributed uniformly) of 10 keV electrons injected on a held line at the 
equator, and Lq — b Earth radii from the Earth’s center. The magnetic held 
varies as 



H(Lo,^) 



Be \/l + 3 sin^ ^ 
Lq cos^ ^ 



(2.84) 



where ^ is the geomagnetic latitude and Be = 3.11 x 10“^ T is the equato- 
rial magnetic held strength at the surface. The equation of the magnetic 
field line is L = Lq cos^ From the conservation of the magnetic mo- 
ment, we can determine the number of particles reflected at geomagnetic 
latitudes of 30° and 60°. We need only the ratio between the magnetic field 
strengths at the injection site and at the reflection point. From (2.84) we 
obtain Brefl/Beq = V 1 + sin^ ^refl/ cos® ^refl, which gives a magnetic field 
ratio of 2.65 for ^ = 30° and 84.7 for ^ = 60°. From (2.83) we then find 
that all particles with a pitch angle larger than 38° and 6.3° are deflected at 
geomagnetic latitudes below 30° and 60°, respectively (that is, 48% and 83%, 
respectively, of the initial population). From the equation of the field line, 
we find that it intersects the surface of the Earth (L = 1) at a geomagnetic 
latitude of 63°. □ 



Example 9. Magnetic pumping is a process in which an adiabatic invariant 
is used to accelerate particles. To illustrate the idea, let us start from an 
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Fig. 2.6. Definition of the loss cone 
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isotropic plasma gyrating in a homogeneous magnetic field. The plasma tem- 
peratures and thus the particle energies parallel and perpendicular to the field 
are the same: Wkin,||,o = M^kin,±,o = Wq. Let us now increase the magnetic 
field slowly by a factor of two, such that the concept of adiabatic invariants 
can be applied but fast enough to prevent an exchange between parallel and 
perpendicular energy. The energy parallel to the field remains unchanged, but 
since the magnetic moment is conserved, the perpendicular kinetic energy in- 
creases as B increases: kFkin,j_ ^ B. If we wait for a sufficiently long time to 
allow temperature exchange between the parallel and perpendicular motions, 
we again have an isotropic plasma, now with fTkin,l|,i = W^kin,±,i = 1.5 Wq 
and thus Wkin,||,i = L51Tkin,||,o W^kin,±,i = L51Tkin,±,o- We can now allow 
the magnetic field to relax to its original value with the same speed as before. 
Again, the parallel kinetic energy remains constant while that of the motion 
perpendicular to the field is reduced by a factor of two: Wkin,||,i = 0-l'51Lkin,||,o 
and Wkin,±,i = L5Wkin,_L,o- The total energy is then W = 1.25Wo, that is 
the plasma has gained energy during this process, which can be repeated for 
further energy gain. □ 



2.4.3 Second Adiabatic Invariant: Longitudinal Invariant 



Two magnetic mirrors combined give a magnetic bottle (see Fig. 2.7): par- 
ticles are confined due to repeated reflection between the mirrors. This is a 
simple configuration in so far as the field is stationary, rotation-free and has 
a rotational symmetry around the field line. The second adiabatic invariant 
is related to the drift motion inside the bottle and the distance between the 
mirrors. The longitudinal invariant can be derived from (2.64) with the mo- 
mentum p 2 = mu|| and the distance s along the field as the spatial coordinate: 



S2 

J 2 = J ds = const 



With (2.77) this can be written as 



J2 = 




m 



(2.85) 



( 2 . 86 ) 




Fig. 2.7. Magnetic bottle 
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The second adiabatic invariant leads to an efficient mechanism for particle 
acceleration: for the acceleration of galactic cosmic rays, Fermi proposed a 
configuration of two converging magnetic mirrors (first-order Fermi effect 
[150, 151]). In terms of energy, the reflection of a particle at a fixed magnetic 
mirror is equivalent to a ball bouncing off a wall. Thus in a magnetic bottle 
with two fixed mirrors the particle oscillates between these mirrors without 
changes in total energy. The interaction with a moving magnetic mirror, 
however, is equivalent to the reflection off a moving wall. Depending on the 
relative speeds between particle and mirror, an energy gain or loss results: 
head-on collisions lead to an energy gain; if the particle and mirror move in 
the same direction, an energy loss results. According to (2.85) in each pair of 
collisions the total energy gain is determined by the shortening of the distance 
between the mirrors, independent of whether the particle meets both mirrors 
head-on or only one of them. A familiar equivalent is the warming of a gas 
during compression, e.g. inside a tire pump. 

The application of the second adiabatic invariant in the Earth’s magne- 
tosphere is not concerned with acceleration but with the asymmetry of the 
field. The dipole field in the inner magnetosphere forms magnetic bottles: 
particles moving from the equatorial regions towards the poles “see” a con- 
verging magnetic field, they move into a magnetic mirror (see Sect. 8.7.1). If 
their pitch angle is sufficiently large to prevent them from entering the loss 
cone, they eventually are reflected back, cross the equator and travel towards 
the other pole. This bouncing motion, however, is not exactly the same as in 
Fig. 2.7: the field fines are curved to follow the Earth’s dipole field and the 
particle therefore faces a stronger magnetic field during that part of the Lar- 
mor orbit closest to Earth than in the other half of the orbit. The resulting 
gradient drift leads to a guiding center motion around the Earth: electrons 
drift from west to east, protons from east to west, forming a ring current. 

In a symmetric magnetic field, the particles should return to a certain 
field fine after each drift period. But, as we shall see in Chap. 8, the Earth’s 
magnetosphere is neither symmetric nor constant in time. Why then should 
a particle return to a certain field line after one drift period? The particle’s 
energy is conserved during the motion. The first adiabatic invariant then re- 
quires that at the mirror point |S| is constant. If a particle has drifted back 
to a certain longitude, in principle it can be on a field line in an altitude 
different from its initial one. This is ruled out by the second adiabatic invari- 
ant, which determines the length of the field line between the mirror points. 
Combination of both invariants requires finding field fines with the same \B\ 
at the reflection point and the same length. But at a fixed longitude this is 
fulfilled by one field line only. Thus even in an asymmetric field the particle 
returns to its original field fine after each drift period. 
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2.4.4 Third Adiabatic Invariant: Flux Invariant 

The third adiabatic invariant, the flux invariant, is also related to the guiding 
center drift. It states that the magnetic flux enclosed by the drift orbit is 
constant. The particle moves on a surface which adjusts itself to variations 
in B so that the flux enclosed by this surface stays constant. Formally, the 
third adiabatic invariant can be derived from the action integral (2.64) by 
using the drift motion to deflne the generalized momentum p = mvD and -0 
as the azimuthal angle of the particles orbit with radius r: 

J 3 == y* mvBrd'ilj . (2.87) 

Similar to (2.65) we then obtain 

^ 47rm , , 

J 3 = -j-p M = const , (2.88) 

with M being the magnetic moment of the axisymmetric held. 

The third adiabatic invariant is also used in the Tokamak geometry of a 
fusion reactor [262,558,559] 



2.5 Summary 

This chapter has introduced the motion of individual charged particles in 
electromagnetic flelds. The particles are considered as test particles: they 
do not interact with other particles and their motion does not influence the 
flelds. The main results are: (i) The elementary motion of a charged particle 
in a homogeneous magnetic field is the gyration around the field line, char- 
acterized by the Larmor radius (2.32) and the cyclotron frequency (2.28). 
If the particle has a velocity component parallel to the field, a helical orbit 
results, (ii) The pitch angle a describes the relation between the particle’s 
motion parallel and perpendicular to the field, (hi) The motion of a charged 
particle can be separated into the motion of its guiding center and the gy- 
ration around the guiding center. The guiding center motion can consist of 
a field parallel motion and drifts, (iv) In slowly and weakly varying fields, 
three adiabatic invariants can be defined, each associated with a typical mode 
of motion: the first adiabatic invariant (constancy of the magnetic moment) 
is associated with the gyration, the second (longitudinal invariant) with the 
field parallel motion of the guiding center, and the third (flux invariant) with 
the drift of the guiding center. The adiabatic invariants can be applied to the 
motion of particles in the radiation belts. 
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Exercises and Problems 

2.1. Describe the concept of the guiding center. What is the reason for drifts? 

2 . 2 . What is an adiabatic invariant? Describe some examples. 

2 . 3 . Derive (2.52) and discuss the conditions under which it can be applied. 

2 . 4 . Derive an expression for the gyro-radius and frequency of a relativistic 
particle. The relativistic momentum isp = 7 mou, where 7 = -^1 — and 
mo is the rest mass. What is the expression for the magnetic moment of a 
relativistic particle? Show that the relativistic magnetic moment is conserved. 

2 . 5 . How is the magnetic moment defined? Give examples of magnetic mo- 
ments. Why is the magnetic moment an important physical quantity? 

2.6. Show that j' = j in the derivation of Ohm’s generalized law (2.14). 

2 . 7 . Derive Coulomb’s law from (2.2). 

2 . 8 . Why does (2.43) give a maximum Larmor radius? 

2 . 9 . Solve the equation of motion (2.27). 

2.10. Develop a simple (numerical) model for the depletion of the radiation 
belt. Start with the information from examples 5 and 8 , and assume losses 
to occur at a height of 1.05 Earth radii from the center of the Earth (about 
300 km height in the atmosphere) and that during each bounce period an 
amount n, with n being 50% of the number of particles lost, of the remaining 
particles are scattered into the loss cone. 

2.11. Determine the gyro-radii and frequencies for electrons and protons 

moving with thermal speeds (see Sect. 5.1.2) in the following fields: (a) the 
Earth’s magnetosphere, with Uq = rip = 10"^ cm“^, = Tp = 10^ K, 

B = 10~^ G; (b) the core of the Sun with = rip = 10^® cm“^, 
Tq = Tp = 10^'^ K, B = 10^ G; (c) the solar corona with rie = rip = 10^ cm“^, 
Tq = Tp = 10® K, H = 1 G; (d) the solar wind with rie = rip = 10 cm“^, 
Te - Tp - 10® K,B = 10~® G. 

2.12. A particle gyrates in a homogeneous magnetic field, (a) Determine the 
size of a volume V which contains an amount of magnetic energy equal to 
the particle’s kinetic energy, (b) Determine the height of a cylinder with this 
volume and a base given by the Larmor orbit, (c) Discuss this result. 

2 . 13 . In the equatorial plane, the Earth’s magnetic field can be described as 
B = Bo{RE/r)^ with Bq = 0.3 G, Re being the Earth’s radius, and r being 
the geocentric distance. Determine the time a particle with pitch angle 90° 
needs to drift around the Earth in the equatorial plane. What is the meaning 
of this time? Determine the period for electrons and protons with an energy 
of 1 keV drifting in a height of bvE above the center of the Earth. Compare 
with the drift due to the gravitational field and the period of an uncharged 
particle (e.g. a satellite) in the same orbit. 
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2.14. A proton of cosmic radiation is trapped between two magnetic mirrors 
with Rm = 5. Initially, it has an energy of 1 keV and v± = in the merid- 
ional plane between the two mirrors. Each mirror moves with = 10 km/s 
towards the other. Draw a sketch of the configuration. Determine the acceler- 
ation of the proton, (a) Does the acceleration continue until the mirrors are in 
contact with each other or does the particle escape? Determine the maximum 
energy acquired by the particle. Determine the maximum energy for other 
pitch angles, too. (b) How long does the particle need to acquire maximum 
energy? (Hint: assume the mirrors to be planes moving with speed Vm and 
show that the energy gain in each interaction is 2vui- How many interactions 
are required for the particle to acquire maximum speed?) 

2.15. The magnetic field of a magnetic mirror varies as == Hq( 1 -f- az^) 
along the axis, (a) At z == 0 an electron has a speed of = 3'yjj = 

Where does refiection occur? (b) Determine the motion of the guiding cen- 
ter. (c) Show that the motion is sinusoidal. Determine the frequency, (d) 
Determine the longitudinal invariant belonging to this motion. 

2.16. A particle of mass m and charge q is at rest in a uniform magnetic field 
B. At time t = 0, a uniform electric field perpendicular to B is switched on. 
Show that the maximum energy gain is 2m{EjBY . 

2.17. A solar proton with energy 1 MeV starts with an initial pitch angle 
of 85° at 2.5 solar radii. The interplanetary magnetic field decreases as r“^. 
Determine the proton’s pitch angle at the Earth’s orbit (213 solar radii) from 
the conservation of the magnetic moment. 

2.18. A 10 keV a-particle is trapped inside the radiation belt at a height of 
10^ km above the surface of the Earth in a magnetic field of about 10“^ T. 
Determine the drift speeds for the curvature and the gradient drift. Compare 
these speeds with the drift caused by the gravitational field. 

2.19. One model for particle acceleration in solar fiares uses the second adi- 
abatic invariant. A shock propagates outward through a magnetic field loop 
of sinusoidal form. Particles gyrate on this loop and bounce back and forth 
from the shock front. Develop a model for particle acceleration. 




3 Magnetohydrodynamics 



‘Glorious stirring sight!’ murmured Toad, never offering 
to move. ‘The poetry of motion! The real way to travel! 

The only way to travel!’ 
Here today-in next week tomorrow! Villages skipped, 
towns and cities jumped - always somebody else’s 
horizon! O bliss! O poop-poop! Oh my! Oh my! 
K. Grahame, The Wind in the Willows 



In the previous chapter we have discussed the motion of individual charged 
particles in prescribed E- and B-fields. Magnetohydrodynamics is different 
for two reasons: (a) it considers an ensemble of particles instead of just a single 
particle and (b) the E- and B-fields are not prescribed but determined by 
the positions and motions of these particles. Thus the field equations and the 
equation of motion have to be solved simultaneously and self-consistently: we 
are looking for a set of particle trajectories and field patterns such that the 
particles generate the field patterns as they move along their orbits and the 
field patterns force the particles to move in exactly these orbits. And all this 
has to be done in a time-varying situation. 

While magnetohydrodynamics describes many useful and important con- 
cepts, it is only a simplistic approach to plasma physics: it describes the 
plasma as a fluid with all particles having the same speed, the bulk speed. 
The thermal motion of particles is neglected. Kinetic theory (Chap. 5) also 
considers the velocity distribution of the particles. 

This chapter consists of four parts. It starts with a brief recapitulation 
of hydrodynamics and an introduction to the basic equations. Subsequently, 
magnetohydrostatics will be concerned with the energetics of the field and 
the particles without allowing for the collective motion of a plasma. Concepts 
such as magnetic pressure and magnetic tension are introduced. In magne- 
tohydrokinematics we shall discuss the reaction of the field to a fluid with a 
given velocity field. Basic concepts such as frozen-in fields and the dissipa- 
tion of fields are introduced. An application of these concepts is the merging 
of magnetic field lines, also called reconnection. In magnetohydrodynamics 
fields and particles can interact freely. In this chapter, the magnetohydrody- 
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namic dynamo will be discussed; magnetohydrodynamic waves will be treated 
in the next chapter together with other types of plasma waves. 



3.1 From Hydrodynamics to Magnetohydrodynamics 

In a gas or fluid, the motion of each individual particle is described by an 
equation of motion. If only electromagnetic forces act on a single particle, the 
equation of motion is given by (2.23). In a plasma, the equation of motion 
might be even more complex because the interaction between the particles 
has to be considered: there are not only external forces acting on the particle 
ensemble but also internal ones. In a plasma we would have to solve the 
equations of motion simultaneously for all particles, which might be billions 
inside a volume as small as 1 mm^. Such a task is impossible to complete. 
Instead, we can treat the plasma as a fluid: we are no longer interested in 
the motion of individual particles but only in the motion of a fluid element 
or the fluid as a whole. So, to understand magnetohydrodynamics, a sound 
knowledge of hydrodynamics is helpful. 

This section recapitulates the basics of hydrodynamics: partial and con- 
vective derivatives, the pressure-gradient force, and the momentum balance 
in different forms, such as Euler’s equation or the Navier-Stokes equation. 
The equations of continuity and state are also recapitulated. 



3.1.1 Partial and Convective Derivatives 

The equation of motion for a particle, F = dp/dt, contains a total derivative 
of the momentum and the external forces acting on the particle. In a fluid, in 
principle, we can use the same approach: single out a volume element, follow 
its path, and calculate the local forces acting on the moving volume. This 
corresponds to Lagrange’s description of particle motion. Here we simply 
would have to multiply the transport equation by the number density n of 
the particles and obtain 



du , 

nm— = nqiE u X B) , 



(3.1) 



with u = {v) being the bulk velocity or average velocity of the particles.^ 



^ The bulk velocity gives the velocity with which the fluid element moves. If the 
thermal motion is ignored, all particles move with the bulk velocity. If the thermal 
motion is considered, each individual particle moves with the sum of its thermal 
velocity Vth and the bulk velocity u: Vp — vth + u. Averaged over all particles in 
the fluid element, the individual particle velocities give the bulk velocity (vp) = u 
because (t;th) = 0. 
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For practical purposes, we normally consider a volume fixed in space and 
measure the properties of the fiuid streaming through the volume. A prop- 
erty e of the fiuid then is given as ^ = s{x^y,z^t) with the spatial coor- 
dinates and the time being independent variables. This corresponds to Eu- 
ler’s description of a fiuid. In contrast, in Lagrange’s description, the spa- 
tial coordinate depends on time too, and a property e of the fiuid is given 
as £ = £{x{t),y{t)^z{t)^t). In the atmosphere, Euler’s description could be 
applied to a stationary thermometer while Lagrange’s description could be 
applied to a thermometer on a radio-sonde carried by the prevailing winds. 

If we are interested in changes in e, we have to calculate its derivative. 
In Euler’s description, the total derivative de/dt and the partial derivative 
de/dt are equal because all temporal derivatives of the spatial coordinates 
vanish. In Lagrange’s description, the chain rule has to be applied: 



de 

dt 



dx de 
dt dx 



dy de dz de de , de 



( 3 . 2 ) 



The change of a property e in a moving fluid element therefore consists of 
two parts: (a) a change in e at a fixed position in space (second term on the 
right-hand side); and (b) the relative motion between the observer and the 
medium (first term). Or, more formally: the total temporal derivative consists 
of a local temporal derivative and advection; it is also called the convective 
derivative. Note that the product (n • V) is a scalar differential operator. 
Occasionally, the total derivative is written as D/Dt instead of d/dt. 

To understand the difference between a convective and a local temporal 
derivative let us take a look at the property of water as, for example, salinity 
or temperature. Let us first consider a closed volume, e.g. a fish-pond. The 
temperature then might change due to absorbed solar radiation, and the 
salinity might change due to evaporation. These changes are local temporal 
changes. Now think of this volume of water as a segment of a river. The local 
changes are still the same but there are also changes due to the advection 
of water from other sites: warmer water might be advected into the volume 
from a power station upstream or the salinity might increase as the incoming 
tide carries water with higher salinity into the volume. 



3.1.2 Equation of Motion or Momentum Balance 

The motion of a fluid element in hydrodynamics can be described by Eu- 
ler’s equation or the Navier-Stokes equation. All these different equations of 
motion have one basic ingredient, the pressure-gradient force. In single-body 
motions, only external forces act on the body. In a fluid, on the other hand, 
regions of different pressure, for instance related to temperature differences, 
can exist, exerting forces on fluid elements. Thus, before inserting the ex- 
ternal forces into the equation of motion, let us have a look at this internal 
force, the pressure-gradient force. 
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Az 



Fig. 3.1. Normal forces p and shear stresses r acting 
on a cubic volume 



Pressure-Gradient Force. Regions of different pressure in a gas exert 
forces: particles move from the high pressure towards the low one. This force is 
proportional to the pressure gradient — Vp and is called the pressure-gradient 
force. Here we give its derivation, closely following Chen [97]. 

Pressure is related to the thermal motion of particles. The pressure- 
gradient force leads to a transport in momentum resulting from the motion 
of particles in and out of a fluid element V\xq = AxAyAz at position xq (see 
Fig. 3.1). If the random thermal motion is limited to the x-axis, particles en- 
ter and leave the volume through surfaces A and B only. The fluid particles 
are characterized by their mass m, their speed v and their number density 
n. During a time interval 



An = An^; VxAyAz 



(3.3) 



paxticles with speed Vx pass through surface A with area A = AyAz into the 
positive x-direction. Here 



Ariv = Avx 




Vy,Vz)dVydv^ 



(3.4) 



is the number density of particles with speed Vx , with / being the distribution 
function (Chap. 5). 

Each particle carries a momentum mvx- The total momentum trans- 
ported through A into the positive x-direction then is 



P+ = J2 AnymvlAyAz = AyAz . (3.5) 

Here the sum over An^ is expressed by the average {v^) of the distribution 
times the particle number density. The factor 1/2 indicates that only half 
of the particles in the adjacent volume element V\xq-/^x at xq — Ax have a 
speed in the positive x-direction and transport momentum through A into 
V\xq- But particles inside V\xq also have a momentum in the positive x- 
direction which is carried out of the volume through the surface B. Their 
number is given as 

P+ = AyAz . 



(3.6) 
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Therefore, the net gain of the positive x-momentum in F|xo is 

- P+ = . (3.7) 

Particles moving into the negative x-direction double the momentum gain 
in (3.7) because the negative x-momentum is transported into the negative 
x-direction: 

3 3 

— (nm^;^)AxA2/Az = -m—{n{vl))iS.x/S.y/S.z . (3.8) 

The particle speed Vx = Ux Vx^^ consists of two parts, the bulk speed Ux 
of the fluid element with Ux = {vx) and the superimposed thermal speed 
fxth with {vx^^) = 0. The latter is described by a one-dimensional Maxwell 
distribution (Sect. 5.1.2). The relationship between average thermal speed 
and temperature is: 

. (3.9) 

With (3.8) we obtain 

3 3 

-^{nmu:,) = -m— [n{{ul) + 2{u^v^J + (v^^J)] . (3.10) 

The last term on the right-hand side can be substituted by (3.9). The term 
in the middle is zero because Ux is constant and thus (u^Uxth) = = 0 

(see Sect. 4.1.3): 



d d 

at*"™"*) = 



nu\ + 



nk^T 



). 



m / 



(3.11) 



The partial differentiation on the right-hand side with nu^ = nux Ux gives 

3{nkT) 



3ux 3n 

mn— + mux-^ = -mUx 



3(nux) 3ux 

— mnux^— 

ox ox 



3x 



(3.12) 



The second term on the left-hand side and the first term on the right-hand 
side cancel (see the equation of continuity (3.34)). With the pressure p defined 
a.s p = nk^T rearrangement leads to 



/ 3ux 3ux 



mn- 



dux 

dt 



3p 

3x 



(3.13) 



Generalization to three dimensions gives the pressure-gradient force density 



mn 



3u 

3t 



+ ix(V • u) 



du ^ 

= mn— = -Vp . 



(3.14) 



Since n is a number density (unit m“^), the product nm gives the density 
Q and we can write alternatively for the acceleration due to the pressure 
gradient force 



du 




(3.15) 
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Equation of Motion: Euler and Navier— Stokes. The simplest equation 
of motion for a fluid considers the acceleration due to the pressure-gradient 
force and gravitation 

^ = + (3.16) 

This equation is known as Euler’s equation and often is used for simple es- 
timates in atmospheric or oceanic motion. Euler’s equation can be applied 
to ideal fluids only. In a real fluid, viscous forces have to be considered too. 
Here the Navier-Stokes equation is useful: 

^ = + (3.17) 

dc g 

with u being the kinematic viscosity. Often, other forces, depending on the 
situation under study, are added to this equation. Some of these forces will be 
discussed below where we also shall have a closer look at the viscous forces. 

Stress Tensor and Viscosity. In the generalization of (3.13) we tacitly 
assumed that Xi-momentum is transported in x^-direction only and that the 
fluid is isotropic. This is true in an ideal gas or fluid but not in a viscous one, 
where momentum can be transported in directions perpendicular to the par- 
ticle motion, and momentum transport is not necessarily isotropic. Then the 
scalar property p has to be replaced by a tensor P, and the pressure-gradient 
force Vp has to be replaced by VP. P not only considers the pressure, which is 
orthogonal to the surface of a volume element, but also shear stresses, which 
are forces parallel to the element’s surface (see Fig. 3.1). The stress tensor P 
has the dimensions of a pressure or an energy density. It is symmetric with 
six independent components Pij for each point: Pij = mnviVj\ i being the 
direction of the momentum transport and j the component of the momentum 
involved. A more compact method to write the stress tensor is 

P = mn{vthVth) ■ (3.18) 

Here is not a shorthand for a scalar product but the tensor product 

(dyad) of two vectors: such tensor products ab of two vectors are tensors T, 
where 

( CLx \ / bx \ / (^xbx ^xby Ojxbz \ 

I I I — I ^ybx ^yby (^ybz 1 • (3.19) 

dz j \bz J \dzbx CLzhy CLzbz j 

In the simplest case, the particle distribution is an isotropic Maxwellian 
and the stress tensor P can be written as 



P = 



p 0 0\ 

0 p 0 = pE , 

0 0 p/ 



(3.20) 
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where E is the unit tensor. Here VP equals Vp. In the presence of a magnetic 
field, a plasma can have two different temperatures Ty and T±_ parallel and 
perpendicular to the magnetic field, leading to different pressures py = nk^Tw 
and px = nk^T^. In a coordinate system oriented with its z-axis parallel to 
H, the stress tensor can be written as 

/pj. 0 0 \ 

P = 0 Pi 0 . (3.21) 

V 0 0 p|| / 

This tensor is diagonal and it is isotropic in a plane perpendicular to B. 

The off-diagonal elements of the stress tensor in an ordinary fiuid are as- 
sociated with viscosity. Viscosity results from collisions between particles and 
tends to make the flow more uniform. Quantitatively, the effect of viscosity 
is described by a kinematic viscosity coefficient u = Vth^ where t’th is the 
thermal speed and A the mean free path between collisions. Alternatively, a 
viscosity coefficient r] = ug can be used. In a fluid, friction is described by 

/frict = + i7?V(V X u) . (3.22) 

In an incompressible fluid, the second term on the right-hand side vanishes: 

/frict = . (3.23) 

This can be interpreted as the collisional part of VP — Vp. Note that the 
inclusion of viscosity into the momentum balance has two consequences: (a) 
in agreement with the irreversible character of the transport process, the 
transport equation is no longer time-reversible: if ix(r, t) is a solution of the 
transport equation, then u(r, —t) is not. (b) Viscosity increases the order of 
the partial differential equation. Therefore, to determine solutions we need 
more boundary conditions than in the case of a non- viscous fluid. 

In a plasma, off-diagonal elements can arise without collisions: gyration 
brings particles into different parts of the plasma, a process which tends to 
equalize the fluid speeds. The scale of this “collisionless viscosity” is given by 
the Larmor radius rather than by the particle mean free path. 

Fictitious Forces in Rotating Systems. The forces discussed so far are 
sufficient to give the equation of motion for a plasma in the laboratory setting. 
In large-scale natural plasmas, such as the ionosphere or stellar atmospheres, 
additional forces act: the Coriolis force and the centrifugal force. 

Consider two frames of reference C and C', with C rotating with an an- 
gular velocity 17 with respect to C'. A vector r fixed in C, in C' moves with 
a speed 17 x r. The temporal derivative of r in C' s 

+17xr or = v f2 x r . (3.24) 
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The temporal derivative gives the acceleration in the rotating frame: 



a = 




d^i?^ di?^ di? 

— - — = — — \-f2xv' = — + 2f2xv + f2x{f2xr). (3.25) 



Thus the density of the fictitious forces in a rotating frame of reference is 

/rot = —^2 17xt; — ^I7x(17xr) (3.26) 



with the first term on the right-hand side describing the Coriolis force and 
the second term the centrifugal force. 

In the near-Earth environment the Coriolis force has to be considered in 
the atmospheric motion and in the ionospheric and magnetospheric current 
systems; it is of vital importance in the dynamo process inside the Sun and 
the planets. The infiuence of the Coriolis force can be illustrated by its effect 
on the atmospheric motion. In the northern hemisphere, wind is deflected 
towards the right. On a global scale, this deflection leads to the break-up 
of the Hadley cell driven by the temperature gradient between the equator 
and the pole into three separate cells, which determine the global atmospheric 
circulation and govern the energy transport from equator to pole. The Coriolis 
force, and therefore the size of the deflection, depends on the wind speed: 
with increasing speed, the distance travelled by a volume of air during a 
time interval increases. A longer trajectory also means a larger displacement. 
The Coriolis force becomes effective only if the scales of the system are large 
enough. Contrary to popular belief, the eddy at the outflow of a bath-tub is 
not due to the Coriolis force: its direction depends on residual motions in the 
water or the motion induced by pulling the plug. 



Electromagnetic Forces. A charged particle in an electromagnetic field 
experiences the Lorentz force (2.23). With n being the number density, the 
force on a volume element can then be written as 



mn 



du 

dt 



= mn 



du 



-h {u • V)u 



= qn{E d-u X B) 



(3.27) 



The dimension of n is m thus (3.27) can also be written as a force density 



du 

Jelmag — ~ ^ 



du . 

-^ + (u.V)» 



= QcE d-j X B , 



(3.28) 



with g = mn being the density, the charge density, and j = nqu the 

current density. Equation (3.28) gives the force density of the electromagnetic 
field. For infinite conductivity, the charges immediately rearrange and cancel 
out the electric field. The force density then reduces to 



/elmag — j ^ B . 



(3.29) 
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Putting it all Together. Adding these forces gives the equation of motion 
or momentum balance: 



du 



= Q 



du 

-VP -f qE + j X ^ — 2gf2 x u — gf2 x (17 x r) . (3.30) 



+ {u • V)u 



If we neglect the electric field and the fictitious forces and split the stress 
tensor into the pressure-gradient force and friction, (3.30) can be written as 



du 



du 



-f {u • V)u 



= — Vp -f guV^u -f j X B gg . 



(3.31) 



This equation is the Navier-Stokes equation used in hydrodynamics comple- 
mented by the forces exerted by the electromagnetic field. 

The momentum balance (3.30) still is relatively simple: (a) it does not 
consider sources and sinks, e.g. due to ionization or recombination, which 
might involve a net gain or loss of momentum; (b) it does not consider mo- 
mentum transport due to Coulomb collisions between charged particles; and 
(c) it does not consider momentum transport arising from the forces exerted 
by a particle component of opposite charge inside the plasma. The latter will 
be discussed briefiy in the two-fiuid description of a plasma (see Sect. 3.2.1). 



3.1.3 Equation of Continuity 

An equation of continuity is concerned with the conservation of a property 
such as mass or charge. A change in e inside a volume V can result from the 
convergence of a fiux C{e) into or out of the volume or sources and sinks S{e) 
inside the volume. The general form of an equation of continuity therefore is 

§ + VC{e) = S(e) . (3.32) 

The most common application is the conservation of mass: 

^ = -V(eu) = -Vi , (3.33) 

where g is the density and j = gu is the mass current density. Using (3.2) 
the conservation of mass can be rewritten as 

^ = = (3.34) 

It states that a change of mass inside a volume is a consequence of the fiow of 
matter into or out of the volume. Local sources and sinks are not considered 
because, except for elementary particle physics, there are none. With Gauss’s 
theorem (A. 33) the integral form of the equation of continuity is 
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^ J QdV = - ^ j do . (3.35) 

V 0(V) 

The equation of continuity for the electric charge is formally analogous, with 
Qc replacing g in (3.33): 






(3.36) 



3.1.4 Equation of State 

Finally, we need a relationship connecting the scalar pressure p and the den- 
sity g: p = p{g^T). The equation of state describes how the temperature 
changes during the motion or compression of a gas. In case of an isothermal 
ideal gas, the equation of state can be written as 

P = C{T)q, (3.37) 

where C is a constant proportional to temperature. If the compression is 
slow compared with thermal conduction (isothermal compression), the pres- 
sure increase results from the density increase but not from the temperature 
increase. In a plasma particles can freely flow along B. Thus conduction 
parallel to B provides the possibility for a plasma to remain isothermal, 
especially if the compression is periodic or wave-like along B. 

A fast moving gas might not be able to exchange energy with its environ- 
ment. The equation of state for such an adiabatic compression is 

p = Cg^^ , (3.38) 

where 7a = Cp/cy is the speciflc heat ratio or adiabatic exponent. For an ideal 
gas, 7a equals {N 4- 2)/N, with N being the number of degrees of freedom. 
For a three-dimensional ideal gas consisting of atoms, 7a is 5/3. Both cases, 
isothermal as well as adiabatic compression, are of importance in different 
types of plasma waves. 

A third important case arises if adiabatic compression is fast compared 
with heat conduction and also is anisotropic. Now the degrees of freedom 
parallel and perpendicular to the held are separated, and T|| (AT == 1, 7a = 3) 
can be heated more efficiently than Tj_ {N == 2,7a = 2). The adiabatic 
invariants can be used to derive generalizations of this relationship. 

The perpendicular pressure p±_ can be expressed by the average magnetic 
moment /x: p±_ — \g{v\) = n{p)B. If the compression is fast compared 
with the heat conduction but slow compared with the gyration period, the 
magnetic moment is conserved, leading to the adiabatic relation 




3.2 Basic Equations of MHD 



57 



Pure perpendicular compression in general is equivalent to an increase in the 
magnetic field strength. For each area A, the conservation of particles implies 
nA = const while conservation of the magnetic fiux yields BA = const. Thus 
n/B is constant, too, and (3.39) reduces to p = with 7a being 2 as 
expected for two-dimensional adiabatic compression. 

The pressure py = Q{vp parallel to the magnetic field is related to the 
second adiabatic invariant J 2 v^\L = const, with L being the scale length 
along the magnetic field. If the compression is slow compared with the par- 
ticle’s oscillation along the field line, then J 2 is conserved. Now length L, 
area A and volume V = AL change. The conservation of particles and the 
magnetic fiux yield nV = const and BA = const. Thus L can be expressed 
as L = VfA^Bjn and we finally get 



dt\n^ 



(3.40) 



Pure parallel compression with B = const then leads to p = Cg^^ with 7 a = 3 
as expected for one-dimensional compression. 

The two adiabatic relations (3.39) and (3.40) are called the “double adi- 
abatic” equations of state. 



3.2 Basic Equations of MHD 

We shall start with the one-fiuid description of a plasma, i.e. the fluid consists 
of one particle species only. This is entirely sufficient to introduce the basic 
concepts (see Sects. 3. 3-4.2). In a real plasma, quasi-neutrality suggests the 
existence of two fiuids with positive and negative charges, respectively. For 
certain phenomena, such as ion waves, a description in the framework of a 
two-fiuid theory will be required, and this is briefly sketched in Sect. 3.2.1. 

In magnetohydrodynamics some assumptions about the properties of the 
system are made: (a) The medium can be neither polarized nor magnetized: 
e = // = 0. (b) Flow speeds and speeds of changes in field properties are small 
compared with the speed of light: u/c <C 1 and t’ph/c <C 1. As a consequence, 
electromagnetic waves cannot be treated in the framework of MHD theory, 
(c) Conductivity is high, thus strong electric fields are immediately cancelled 
out: E'/B <C 1. As a consequence, the displacement current dE/dt can be 
ignored compared with the induction current. MHD is a theory linear in u/c, 
t^ph/c, and E/B and ignores all terms of higher order in these quantities. 
MHD considers the conservation laws of fluid mechanics which are concerned 
with mass, momentum, energy, and magnetic fiux. The formal description is 
then based on the following set of equations: 

• Maxwell’s equations (Sect. 2.1.1): 



V • E — gc/so , 



(3.41) 
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VB = 0, 

V, ^ dB 

' 

V X B = |ioj ; 

• Ohm’s law (Sect. 2.1.3) 

j = a{E + ux B) ; 

• equation of continuity (Sect. 3.1.3) 

OQc 



dt 



+ V{uQc) = 0 ; 



• equation of motion (momentum balance, Sect. 3.1.2) 

+ g{u ■ V)ti = -Vp + j X B + gg + gi/V^u ; 
at 

• equation of state (Sect. 3.1.4) 



(3.42) 

(3.43) 

(3.44) 

(3.45) 

(3.46) 

(3.47) 



This set of partial non-linear differential equations can be solved for given 
boundary conditions. For certain applications only a part of the equations is 
required, or some equations can be used in a simplified form: in magnetohy- 
drostatics (Sect. 3.3) the left-hand side of the momentum balance vanishes 
while in magnetohydrokinematics (Sect. 3.4) an external velocity field is pre- 
scribed and therefore the momentum balance can be ignored completely. 

The momentum balance gives us hints on the kind of motion: in certain 
slow motions the inertial term gii can be ignored while in weak magnetic 
fields the Lorentz force can be ignored. The relative strength of these two 
forces is determined by the ratio 

5^/2/io _ magnetic field energy density 
gu ‘^/2 kinetic energy density v • ; 

For 5 > 1 the magnetic field determines the motion of the particles and the 
single-particle approach can be used. For 5 <C 1, the magnetic field is swept 
away by the plasma motion, in accordance with the concept of the frozen- 
in field described in Sect. 3.4.1. S is another expression for the plasma-/?, 
giving the ratio between the gas dynamic pressure and the magnetic pressure: 
(3 = 2fiop/B\ 

It should be noted that these two definitions are useful only for an isotropic 
plasma. If the plasma is anisotropic, frequently a parallel and a perpendicular 
plasma-/? are defined as 
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p 2/ioP|| , ^ 2/ioPj. 

I^W = ■ (^-50) 

In a low-/? plasma (^3 <C 1), the energy density in the thermal motion is 
much larger that in the magnetic field, while in a high-/? plasma (/? ^ 1) the 
opposite is true. 



3.2.1 Two-Fluid Description 



So far, we have treated the plasma as a fluid consisting of one kind of 
charged particles only. A real plasma, however, contains electrons, ions, and 
possibly also neutral particles. Each particle component has its own speed, 
temperature, and partial pressure. 

Since a plasma is expected to be quasi-neutral, the number of positive and 
negative charges has to be equal. The charge density is Qc = n[qi -f- rieQe = 
+ Qe with Hi and rie being the number densities of ions and electrons with 
charges q[ and qe- The current density is j = riiqiUi -f rieqeUe = ji + je- If 
we limit ourselves to a two-fluid plasma, we have to deal with an electron 
and an ion component; the neutral component is ignored. In addition to the 
assumptions made in the one-fluid description we assume: (a) the fluid is in 
thermal equilibrium {T[ = Te), and (b) the plasma is quasi-neutral (^i = ^e)- 
The basic equations in two-fluid MHD are 



Maxwell’s equations 



V • JE — (^i + Qe)/ eo , 

V-B = 0, 

dB 



V X E = -■ 



dt ’ 



Ohm’s law 



dE 

V X B = floUi + je) + ; 



dj „ ^ j X B Vpe j 

=E^uxB-~ 

e^n ot en en a 



equation of continuity 

drij 

dt 



V-{rijUj)=0, i = i,e 



momentum balance (equation of motion) 



(3.51) 

(3.52) 

(3.53) 

(3.54) 

(3.55) 



(3.56) 



duj 
^ ^ dt 



= qjTij {E + Uj X B) - Vpj ± (3{ui - Ue) , j = i,e; 



(3.57) 



Pj =Pj{Qj,Tj) , j = i,e. 



• equation of state 



(3.58) 
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Compared with the equations in one-fluid MHD we And the following differ- 
ences: (a) The equations of state, motion and continuity are given for each 
component separately, (b) The equation of motion contains an additional 
term coupling the two components to consider momentum transfer arising 
from Coulomb collisions. The force between the two components depends on 
their relative speed, therefore f { = —fe = /3{ui — Ue). (c) Gauss’s law for the 
electric field contains both charge densities as Ampere’s law contains both 
current densities, (d) Ohm’s law has become unrecognizable. A derivation 
of Ohm’s law from the equation of motion can be found in [285]; here only 
the terms will be explained. The left-hand side gives the current accelera- 
tion. The first, second and last terms on the right-hand side are expressions 
already known from Ohm’s law in one-fluid MHD. The j x B term is called 
the Hall term and describes the Hall effect: in a magnetic field the current 
created by the moving charges is deflected by the Lorentz force, resulting 
in an additional electric field perpendicular to both j and B. The fourth 
term on the right-hand side gives the pressure diffusion: in the presence of 
a pressure gradient, both particle species diffuse with respect to each other, 
creating a current along Vp. 



3.3 Magnetohydrostatics 

Magnetohydrostatics deals with the energetics of particles and fields. It does 
not require the entire set of MHD equations; instead, the field equations and 
the equation of motion (with vanishing inertial term) are sufficient. Important 
concepts are magnetic pressure and magnetic tension. 

3.3.1 Magnetic Pressure 

Let us now take a closer look at a magnetic field such as shown in Fig. 3.2. 
The lines of force are parallel to the z-axis with the field strength varying 
along the x-axis: B = (0,0, 5(x)). The force density exerted by the field is 
f = j X B. With j expressed by Ampere’s law (3.44) we obtain: 




Fig. 3.2. Magnetic pressure: field gradient perpendicular to the field {left) and the 
resulting spatial distribution of the field strength (right) ^ with arrows indicating 
the direction of the magnetic pressure 
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f=^jxB = {VxB)x B/fio . (3.59) 

For a general derivation of the magnetic pressure and tension, we can use 
(A. 20) and (2.19) and obtain 

S X (V X JB) = -1 V(BB) + - B(V • B) 

Ho Ho Ho 

= -1 V{BB) + ^VB^ . (3.60) 

/io 

The first term on the right-hand side gives the force density arising from the 
magnetic stress tensor BB which describes magnetic tension and torsion. The 
second term is formally equivalent to the pressure-gradient force, but instead 
of gas pressure p a magnetic pressure 5^/(2/io) is used. Both the magnetic 
pressure and the magnetic tension can also be derived more graphically from 
(3.59) if we use simplified geometries. 

For the field defined above, (3.59) yields 





(3.61) 


Thus, the force density only has a component along the x-axis 
generally, perpendicular to B and parallel to the field gradient): 


(or, more 


_ _ d 

^ /io dx dx 2/io 


(3.62) 


Therefore, an inhomogeneity in the magnetic field gives rise to a force den- 
sity pushing field lines back from regions of high density into low density 
areas. Such behavior is well known from an isothermal gas where a restoring 
force / ~ Vp tries to cancel out pressure gradients. Therefore, (3.62) can be 


interpreted as the magnetic pressure: 




Pm = 7^ — ■ 
2ho 


(3.63) 



Graphically, this magnetic pressure can be described as the tendency of 
neighboring field lines to repulse each other. Note that, in contrast to the 
gas-dynamic pressure, the magnetic pressure is not isotropic but is always 
perpendicular to the field. 

The analogy with gas-dynamic pressure can be pushed even further if 
we invoke the concept of frozen-in magnetic fields (Sect. 3.4.1). Imagine a 
magnetic field frozen-into a plasma: each plasma parcel contains a certain 
amount of magnetic fiux which is tied to this plasma element and follows its 
path as the plasma parcels are shufiled around. Thus a field gradient always 
has to be combined with a gradient in gas-dynamic pressure. As the plasma 
attempts to reduce the pressure gradient, the field will be homogenized, too. 

Formally, the magnetic pressure also could be inferred from Maxwell’s 
stress tensor, as is shown in [36]. 
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Fig. 3.3. Model of a sunspot: the gas-dynamic 
pressure from the outside is balanced by the mag- 
netic pressure inside the sunspot 



Example 10. A homogeneous magnetic field of 5 T, according to (3.63), exerts 
a magnetic pressure p = J5^/2//o == 9.95 x 10® N/m^ = 995 hPa x 100, which 
is a hundred times the atmospheric pressure at sea level. □ 



Example 11. Sunspots are a prime example of the apparently paradoxical 
behavior of plasmas as well as a good illustration of the concept of magnetic 
pressure. Sunspots (Sect. 6.6, Fig. 6.26) are cool and dark patches on the 
visible solar disk. Temperatures in sunspots are 1000 K to 2000 K below the 
temperature of the ambient photosphere (5700 K). Despite this temperature 
gradient, the sunspot does not mix with the ambient plasma. Instead, it is 
very stable and can survive for many solar rotations. This longevity results 
from the magnetic pressure: inside the sunspot the magnetic field is about 
3000 gauss compared to a few gauss at the outside. The boundaries of a 
sunspot are sharp in both magnetic field strength and temperature. 

Figure 3.3 shows a simple model of a sunspot. The indices “i” and “o” refer 
to plasma and field properties inside and outside of the sunspot, respectively. 
To prevent hot photospheric material from streaming into the sunspot, the 
gas-dynamic pressure Po of the photospheric plasma has to be balanced by the 
combined magnetic and gas-dynamic pressure inside the spot: pid-5?/(2/io) = 
Pq. Here we have assumed that the magnetic field pressure outside the sunspot 
is negligible and that dBjdz is zero, as is suggested by observations. The 
variation of pressure with height is described by the hydrostatic equation: 
dpjdz = p^'sun- Since B is independent of height, this yields dp\jdz — dpojdz 
and Qi = Qo- On the other hand, the universal gas law yields for the pressure 
p = ghsT/m. Because the densities inside and outside the sunspot are equal, 
the gas law requires T[ to be smaller than To to fulfill p[ < Po- Thus, in 
agreement with the observations, we find: for longevity, the higher magnetic 
field inside the sunspot has to be combined with a lower temperature and, 
consequently, less electromagnetic emission. □ 
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3.3.2 Magnetic Tension 

Let us now have a closer look at a simple interpretation of the first term in 
(3.60) which is concerned with magnetic tension. The upper panel of Fig. 3.4 
shows a homogeneous magnetic field parallel to the x-axis: Bq = (5o,0,0, ). 
The field is assumed to be frozen-into a plasma. The plasma motion u = 
(0, 0, Uz{x)) (middle panel) leads to the deformation of the field shown in the 
lower panel. The distorted field can be described as the superposition of the 
undisturbed field Bq and a disturbance SB (see (3.94) in Sect. 3.4.2): 

dB 

B = Bq H — ~ + V X (ix X B)dt . (3.64) 

With (3.59) and the field described above we find a force density parallel to 
the disturbing velocity field 

The force, called the magnetic tension, always is a restoring force: if the field 
lines have a convex curvature into the upward direction, d^Uzfdx"^ is less than 
zero, leading to a force directed downwards. If the curvature is opposite, the 
force also is in the opposite direction. The magnetic tension can also be 
interpreted graphically: magnetic field lines have a tendency to shorten. 

Example 12. Again, consider a 5 T magnetic field. It is disturbed by a si- 
nusoidal velocity field v = vq sinkx, where vq = 1 m/s and k = b m~^, 
acting for St = 1 ps. To determine the force density, we need the second 
derivative of the velocity: v” = —vok‘^ sinkx. With (3.65), we then obtain 
f = 20 N/m^ sin((5/m)x) and thus for x = 0 m, / == 0 m, because here 
the magnetic field is not displaced from its original position and no restoring 
force acts on it; for x = 2.5 m we obtain 14 N/m^, and for x = 5 m (maximum 
displacement), / = 20 N/m^. □ 




Fig. 3.4. Magnetic tension: the lines of 
force in a homogeneous magnetic field {upper 
panel) are distorted by a velocity field {mid- 
dle panel), giving rise to a restoring force in- 
dicated by the arrows {lower panel) 
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Example 13. Solar Filaments: Another example of the unusual behavior of a 
plasma are solar filaments or protuberances which are cold and dense mat- 
ter suspended from magnetic arcades high above the photosphere. Such a 
structure is called a filament as long as it is seen as a dark (because cold) 
stripe in front of the photosphere. As the Sun rotates, this structure becomes 
visible as a bright (because dense) arc extending high above the photosphere: 
a protuberance. The spatial structure clearly becomes visible: the filament 
extends vertically above the photosphere, covering angular distances of some 
10 degrees. Filaments are stable and can last for a few solar rotations. Un- 
der certain conditions they become unstable and are blown out violently as 
coronal mass ejections (Sect. 6.6). Typical temperatures are about 7000 K 
(ambient corona: 10^ K). The density is about 100 times larger than the am- 
bient density; the typical vertical extension is up to 30 Mm, that is about 
100 times the scale height in the corona. 

The first theoretical description of a filament goes back to Kippenhahn 
and Schliiter [286]. Here we shall limit ourselves to a much shorter, more 
general discussion. Filaments are roughly aligned along the neutral line be- 
tween regions of opposing magnetic fields (Sect. 6.7). Thus the magnetic field 
seems to play an important role in the existence as well as stability of the 
filament. Figure 3.5 sketches the situation: the filament (thick vertical line) 
is supported by magnetic arcades connecting opposite polarities in the pho- 
tosphere. The magnetic field lines do not form perfect arcades, instead they 
are ditched-in at the position of the filament: gravity pulls down the filament 
which in turn pulls down the frozen-in magnetic field. The deformation of the 
magnetic field causes magnetic tension in the opposite direction. Thus the 
filament is held at a certain height by an equilibrium between gravity and 
magnetic tension. 

We can obtain a more quantitative statement from the basic MHD equa- 
tions. Here we need the equation of motion in the stationary case. 



Vp= — {V X B) y. B gg , 



(3.66) 



and the equation of state p = n/cuT, with T being spatially constant. 

Following [285] let us define a coordinate system with the x^-plane tan- 
gential to the surface of the photosphere and the ^-direction extending along 
the filament into the drawing plane. The 2 :-direction points upward; thus 
Fig. 3.5 is a cut through the filament in the xz-plane. All quantities are as- 
sumed to be independent of y, thus d/dy is zero. The magnetic field is given 
as B = {Bx,0, By) and g is (0,0,-g). The double cross product in the equation 
of motion then can be written as 



(V X B) X B = (^ - ^) (B„0,B.) . (3.67) 



Combination of the equation of motion and the equation of state gives 




3.3 Magnetohydrostatics 



65 



and 



, _dn 1 _ 
ksT— = 
ox fio 



kBT^ = -—B, 
OZ /io 



dB^ dB, 



dz dx 

dB^ _ 
dz dx 



mng . 



(3.68) 



(3.69) 



Differentiation of (3.68) to z and of (3.69) to x and subtraction of these 
equations yields 



dB, dB, 

dz dx 



dBx _ dBx 
dz dx 



+B + 

^ \dzdx ) ‘ 

Gauss’s law for a magnetic field in the two-dimensional case gives 



dp 



(3.70) 



^ = d^B^ _ d^B, 

dxdz dx^^ ^ dxdz dz‘^ 



(3.71) 



With Hp = kT/mg being the scale height in the barometric height formula 
p = Po exp{- f dz/Hp} and substitution of dnjdx according to (3.68), (3.70) 
can be written as 



+ (3.72) 

Prom this equation the details of the magnetic field as well as the density 
inside the filament can be determined and compared with observations. These 
results confirm the sharp bend in the magnetic field: inside the filament the 
lines of force are bent according to tanh. Thus there is still a steady field and 
not a discontinuity. In addition, the model predicts a decrease in density with 
increasing height, as is evident from the observations. This decrease leads to 
a flattening of the ditch in the field lines with increasing height, which also 
is indicated in Fig. 3.5. 

Formally, the fine structure inside the filament, which is also a crucial 
factor for its stability, can be derived by simplifying (3.72). Since we are 




Fig. 3.5. Model of a solar filament 
{thick vertical line) suspended from 
magnetic arcades 
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interested only in the details inside the filament, we are concerned with a 
height range small compared with the total extension of the structure. In 
this case we can assume djdz — 0. Prom Gauss’s law, V • B = 0 for the 
magnetic field, we obtain — const and Bz — Bz{x). In this case, (3.72) 
gives 



B, dB, 
dx‘^ Hp dx 



(3.73) 



or, with a = l/{HpBx) = const. 



d^Bz 



+ otBz 



dBz 

dx 



= 0 . 



(3.74) 



Integration gives 

^ = const . (3.75) 

A solution of this diflFerential equation is 



BOO 

Bz = tanh^ , where ^ = 7777%” ^ 

ZHpJDx 



and B^ = const is the value of Bz for z — )> oo. Inside the filament, the field 
lines therefore do not exhibit a sharp kink but the smooth evolution of a tanh 
function. □ 



3.4 Magnetohydrokinematics 

Magnetohydrokinematics deals with the reaction of the electromagnetic field 
to a prescribed velocity field such that the electromagnetic field does not 
infiuence the velocity field. Thus, we do not have to solve the equation of 
motion. Such a situation corresponds to a large plasma-/? or a small value of 
S in (3.49). The basic equations to derive concepts such as frozen-in fields 
and the dissipation of fields are Maxwell’s equations and Ohm’s law. 



3.4.1 Prozen-in Magnetic Fields 

What happens to an electromagnetic field embedded in a moving medium 
with high conductivity? Let us assume a magnetic field B(r,to) at a time to 
and a prescribed velocity field u{r^t). The magnetic fiux through a surface 
S enclosed by a curve C then is ^ = f B dS. Let us follow the motion of C 
(see Fig. 3.6): as C moves, the magnetic fiux through S changes because (a) 
the magnetic field varies in time and (b) the field lines move into or out of 
S. As C moves, it creates a cylinder with a mantle surface M. All changes 
in flux through S due to the field lines entering or leaving C is associated 
with a flux of the very same magnetic field lines through M. Thus the total 
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Fig. 3.6. Moving fluid line C to derive the concept of 
the frozen-in magnetic fleld 



change in magnetic flux through S can be described by 



d^ 



#2 - <?i = di y ^ dsi + y basm . 

Si M 



(3.77) 



A surface element Sm on M is given as dSM = u x dldt with dl being the 
path along C and udt being the path along the direction of motion of C. 
Equation (3.77) therefore yields 

5i Cl 



With Stokes’ theorem the last term can be written as 

J B ' u X dll = — J u X B ' dll = — J V X {u X B) dS_ 



(3.79) 



Inserting into (3.78) gives 



dt 



J 

Si 



dB „ , 

— - V X (w X B) 



dSi 



(3.80) 



The uxB term can be expressed by Ohm’s law (3.45) while the dB/dt term 
can be expressed by Faraday’s law (3.43), and we obtain 

d^ f 1 f 1 

-=-yvxi-<is.=-y-i.di.. (3,81) 

S-i Cl 

The change in magnetic flux through a moving surface therefore is pro- 
portional to 1/cr. If (7 converges towards infinity, d^/dt converges towards 
zero: in a medium with infinite conductivity a, the magnetic field is frozen- 
into the plasma and carried away by the matter as if glued to it (left side in 
Fig. 3.7). A prime example of the application of this concept is the interplan- 
etary magnetic field frozen-into the solar wind (Sect. 6.3). A reversal of the 
concept, the frozen-out field, exists too. In the right panel of Fig. 3.7 a field- 
free plasma bubble moves towards a region filled with a magnetic field and 
pushes the field away until its kinetic energy is transferred to additional field 
energy as evidenced by an increase in magnetic pressure as well as magnetic 
tension. The field cannot enter into the bubble because then the magnetic 
fiux inside the bubble would change. An example is the solar wind frozen-out 
of the Earth’s magnetic field (Sect. 8.2). 
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Fig. 3.7. Prozen-in (left) 
and frozen-out (right) 
magnetic fields 



3.4.2 Deformation and Dissipation of Fields 

Frozen-in magnetic fields always are connected with an infinite conductivity. 
But what happens to a magnetic field embedded in a flow with finite conduc- 
tivity? For simplicity let us assume a to be spatially and temporally constant. 
The combination of Faraday’s law (3.43) and Ohm’s law (3.45) yields 

O O 1 

— V X (it X B) = — V X j . (3.82) 

at (T 

The current density can be expressed by Ampere’s law (3.44), leading to 

OB r-, , 1_ VxB /o OOX 

- 7 T V X (ux B) = — V X = V X (V X ^) . (3.83) 

at a fiQ iiqct 

The double cross product can be simplified with (A. 26): 

f)B 1 

— - V X (ti X B) = V^B. (3.84) 

at fiQa 

This equation allows us to determine how a given velocity field u deforms a 
magnetic field B. 

Deformation of the Field in a Plasma Flow. If we assume both magnetic 
held and plasma flow to be independent of time, a stationary solution exists 
with 

-V X (ux B) = . (3.85) 

fioa 

With a characteristic time scale r and a characteristic length scale L, the 
flow speed u± perpendicular to the field can be estimated: 

L 1 
r iioaL 



(3.86) 
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The physical interpretation is simple: if a plasma flows perpendicular to the 
magnetic held, it deforms the lines of force until their characteristic scale 
length is small enough to fulflll (3.86). Then the plasma starts to flow across 
the lines of force. 

Excursion 3. Dimensionless Variables and Dimensional Stability. To deter- 
mine the deformation of a line of force quantitatively, we shall use the tech- 
nique of dimensionless variables. This technique is quite common in fluid 
dynamics (see e.g. [149]). It is helpful to determine not only one solution of 
the differential equation but an entire manifold of solutions which can be 
scaled to the situation under study. This is particularly helpful in hydrody- 
namics when the solution for a certain size of syringe or nozzle is known and 
we are looking for a dynamically similar flow on a different scale. 

The idea is quite simple: all equations representing scientific laws can be 
expressed such that both sides are dimensionless. In its simplest case, just 
divide one side of the equation by the other: the result, one, is dimensionless. 
To take advantage of dimensionless variables, first identify the physical vari- 
ables relevant to the problem and combine them into dimensionless groups 
These groups have to be independent of one another. If the groups 
are dimensionless, combinations of groups such as AB or A/B‘^ are dimen- 
sionless, too. But they are not independent of either A or B, though any one 
of them might be included instead of A or B if this seems advantageous. If 
the groups are chosen in such a way that the quantity of interest occurs in 
only one of them, it can be expressed by the function A = f{B,C ^ ...). The 
nature of this unknown function can be determined analytically (as demon- 
strated below) or by computational methods. In an analytical solution, the 
advantage of the use of dimensionless variables is small; it only shows which 
parameters are important in scaling. If the solution has to be obtained by 
numerical simulations, the advantage of this method is more obvious: the 
procedure to determine a solution for one particular set of parameters can 
be quite time consuming. Each other set of parameters would require a new 
run. If dimensionless variables are used instead, the nature of the solution 
becomes obvious and it can be scaled to suit different sets of parameters. □ 

Let us now follow this principle and introduce new variables as suggested 
in [285]: 

L 

B = bB , r = rf , t — rt, u = Uu , U = — . (3.87) 

r 

The quantities with a tilde are dimensionless. With the abbreviation 

1 

V= — , 

fioa 

which can be interpreted as a magnetic viscosity, (3.84) yields 



(3.88) 
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Here a tilde above the differential operator indicates that the operator refers 
to a dimensionless variable. 

In ordinary hydrodynamics, the Reynolds number is a measure of the 
ratio between inertial and viscous forces. If the Reynolds number exceeds 
a critical value, the flow becomes turbulent. In its definition, the Reynolds 
number contains typical scales which have to be adjusted to the problem 
under study. Here we shall use a magnetic Reynolds number: 

Rm = = iiqcfUL . (3.90) 

V 

It differs from the ordinary Reynolds number in so far as the viscous forces 
described by rj do not depend on forces between particles but on the conduc- 
tivity of the medium (see (3.88)). The magnetic Reynolds number can also 
be interpreted as the ratio of the time scale of ohmic diffusion 



"TdifT 



47tL^ 

c^rj 



(3.91) 



to the advective time scale 



L 

"Tadv — 

V 



(3.92) 



equal to 



Rm = 



'Tdiff 

^adv 



AttvL 47 t V X {v X B) 

c^T] c^rj V X (V X B) 



(3.93) 



The latter is the ratio of the induction term to the dissipation term of the 
induction equation (3.84). 

Now we can rewrite (3.89) as 



-^-Vx{uxB) = — . (3.94) 

This dimensionless form has an advantage: it shows directly that a three-fold 
set of solutions exits. What does this mean? Assume we know a solution 
B{f,i) in dimensionless variables for a fixed Reynolds number Rm and a 
velocity field u. In this case, 6 B(Lf, rt) also is a solution of the same Reynolds 
number and the velocity field Uu as long as the conditions UL/rj = Rm and 
L/t = U are fulfilled. For instance, a free choice of U and L for a given Rm 
determines the values r and rj of the solution. But we still have a free choice 
for b. Thus one solution of (3.94) contains a three-fold infinite manifold of 
solutions, characterized, for instance, by C/, 77 , and b. 

Let us now determine the solution of (3.94) for a stationary parallel fiow 
perpendicular to a homogeneous magnetic field. Since the flow is stationary, 
(3.94) reduces to 

-V X (« X B) = ^V^B . 

Ru 



(3.95) 
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Let us orientate the x-axis of a Cartesian coordinate system along the flow: 
^ 0, 0). The magnetic held B = {Bx[z)^^^Bz{z)) has one com- 

ponent parallel and another perpendicular to the flow. Note that the flow 
varies along the perpendicular component. From V x JB = 0 we And that Bz 
is constant. Equation (3.95) is a second-order linear inhomogeneous partial 
differential equation for Bx as a function of z: 



d^Bx 

dz^ 



-BzRm 



dux 

dz 



Integrating twice we get 



Bx — Bz Rm 



^ J' Ux (^z C z D 



(3.96) 



(3.97) 



with C and D to be determined to fulfill the boundary conditions. Let us 
now assume the flow to have a cosine profile around z = 0, that is Ux == cos z 
for 1^1 < 7 t/ 2 and fix — 0 for |z| > tt/ 2. Equation (3.97) then reads 



Bx = — B; 2 ji?M(sinz Cz + D) . (3.98) 



The tangential component of B should be steady at the boundary of the 
flow to avoid currents; thus one boundary condition is Bx{z = tt/2) = 0. In 
addition, the flow is assumed to be symmetric around z = 0; thus the second 
boundary condition is Bx{0) = 0. The integration constants therefore are 
D = 0 and C = — tt/2 and (3.98) can be written as 

Bx = —BzRm fsin^ - . (3.99) 



We can now define a magnetic stream function 'll; 



d'lp 

dx 



= -Bz 



and 



dll; 

dz 



Br 



(3.100) 



Then V-0 x V.B is zero and lines with ip = const are the held lines. Integration 
of the second part of (3.100) combined with (3.99) gives the line of force as 



'0 = BzRm cos z + 



7T 



B.x 



(3.101) 



Let us now determine the maximum displacement of a line of force (see 
Fig. 3.8). Because ip is constant along a field line, it is '0(Ax, 0) = -0(0, tt/2). 
Therefore the maximal displacement is given as 

Ax = (l - ^) Rm ■ 



(3.102) 
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Fig. 3.8. Deformation of a magnetic field line by 
a plasma fiow. The maximal displacement Ax is 
determined by the Reynolds number Ru 



Thus the deformation of the magnetic field line increases with increasing 
Reynolds number. This is not surprising because the Reynolds number de- 
pends linearly on the conductivity (see (3.90)): if the conductivity and there- 
fore the Reynolds number is infinite, the magnetic field is frozen into the fluid 
and deformation of the field lines becomes maximal. 

Note that, in contrast to the frozen-in case, for finite conductivity matter 
starts to flow across the field after it is curved according to (3.101). The 
flow across the field is largest for small dimensions because on small scales 
the condition for frozen-in fields can be violated more easily. Thus we can 
confirm the suggestion made in connection with (3.86): at each point the flow 
curves the field such that the radius of curvature becomes small enough to 
allow for a flow of matter across the field. The physical reason is a reduction 
of the dissipation time with decreasing spatial scales as will be described 
below. 



Dissipation of Fields. Let us now have a look at a vanishing external 
velocity field. Then the second term on the left-hand side of (3.84) vanishes 
and we get 



ot i^ocr 



(3.103) 



Formally, this equation is equivalent to the heat conduction equation 



dT 

dt 



= XV'T, 



(3.104) 



where x is the thermal conductivity, and to the vorticity equation 



du: 



— , 



(3.105) 



where = V x tx is vorticity that describes the rotational state of the fluid. 
Note that the coefficients k, and u all have the same dimensions 

(mVs). 

While Parks [397] gives a formal description of the consequences of 
(3.103), we shall use a more graphical approach. Let us align the x-axis 
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of our coordinate system parallel to the magnetic field direction: B = 
(. 83 ^( 2 /, t), 0, 0). Equation (3.103) then can be simplified to 



dB^ ^ 1 
dt fxoa dy'^ 



(3.106) 



Formally, this is equivalent to the one-dimensional heat conduction equation 



dT d'^T 



(3.107) 



where T{y) is the one-dimensional distribution of temperature and x is the 
thermal conductivity. Equation (3.107) gives the temporal change in tem- 
perature as the heat is transported away by conduction. Therefore, (3.106) 
gives the temporal change in magnetic field strength as the magnetic field is 
transported by a process which depends on conductivity: the field dissolves. 
Assume that B is particularly strong at a certain position, say y = 0. This is 
analogous to a very hot spot on a metal rod; here we would expect the hot 
spot to cool down while the other parts of the rod warm up as heat is trans- 
ported towards them. The same thing happens with the magnetic field: it 
dissolves to larger values of \y\. Note that, while the magnetic fiux inside the 
yz-plane stays constant during this process, the magnetic energy decreases 
because the field-generating currents are associated with ohmic losses. 

If r is a characteristic time scale for magnetic field changes (e.g. the 
dissipation time during which the field strength decreases to 1/e) and L is 
the characteristic scale length, the change in B can be estimated from (3.106): 



B _ 1 B 

T IjLoCF L‘^ 



(3.108) 



Thus the dissipation time is 

T K yboaL^ = /Dm , 



(3.109) 



where Dm = I/mo^^ == V be interpreted as a magnetic diffusion coefficient, 
r depends on the square of the characteristic scale length of the field: smaller 
fields dissipate faster than larger ones. That is the reason why with reduced 
spatial scales plasma starts to flow across the field. In addition, the dissipa- 
tion time increases with increasing conductivity: for infinite conductivity, the 
dissipation time becomes infinite too, leading to the frozen-in field. 

The Sun, for instance, has a linear dimension of about 7 x 10^ m and 
an average conductivity of 2.6 A/Vm. This gives a dissipation time of about 
1.2 X 10^^ years, nearly three times the age of the Sun. Thus if during its 
creation the Sun had received a magnetic field, this still would be present 
today as first suggested in [302]. On the other hand, the solar magnetic field 
is highly variable on time scales of months to years (Sect. 6 . 6 ), making the 
presence of a fossil field very unlikely. Instead, a MHD dynamo (Sect. 3.6) 
seems to be responsible for the solar magnetic field. 
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Fig. 3.9. The dissipation time of the 
magnetic field decreases as the spatial 
scales decrease 



According to (3.109) the dissipation time depends on the scale of the 
field. Thus, if the field on the left-hand side of Fig. 3.9 is divided into smaller 
patches of length L/n instead of L, it dissipates times faster than the 
original field. Such a redistribution of field lines smearing out the boundaries 
between regions of opposite polarity and leading to structures on smaller 
scales can result from turbulent plasma motions. For instance, the stochastic 
motions in the photospheric and chromospheric network on the Sun might 
contribute to the dissipation of magnetic fields, in particular in the declining 
phase of the solar cycle. 

Example 14- A sunspot with a radius of about 20 000 km has, from (3.109), 
with the conductivity given above, a lifetime of about 1000 years. If we look 
more closely at the sunspot, in particular the granules around the spot, we 
find a spatial scale of about 1000 km, that is, 1/20 of the scale of the sunspot. 
Since the dissipation time depends on the square of the length scale, in the 
granules it is only 1/400 of the value for the whole sunspot, that is 2.5 years 
- which comes closer to the observed lifetime of a sunspot. □ 

A vortex in the plasma flow might even create a field-free region inside 
an otherwise relatively undisturbed held [556]. 



3.5 Reconnection 

The dissipation of magnetic held lines is important, e.g. in reconnection, 
which is assumed to take place in many locations in the solar system, such £is 
solar flares, the tails of magnetospheres, and in the exchange of solar wind and 
magnetospheric plasma at the day-side magnetopause (flux-transfer events). 
Reconnection not only plays an important role in the rearrangement of mag- 
netic flelds but also in the formation of shock waves and the acceleration of 
energetic particles. 

The concept of reconnection goes back to Petschek [405]. It is widely used 
in magnetospheric and solar physics, although the physics behind the process 
still is under debate; sometimes it is even questioned whether reconnection re- 
ally exists. The basics of reconnection are outlined in Fig. 3.10. Reconnection 
requires a topology where two magnetic flux tubes of opposite polarity meet 
(a). According to Ampere’s law, in the neutral line between these flux tubes 
a current flows perpendicular to the drawing plane with a current density 

1 AB 
^ d 



(3.110) 
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Fig. 3.10. Reconnection: merging of magnetic field lines leads to a rearrangement 
of fields. In addition, magnetic field energy is released, heating the plasma, creating 
a shock wave, and accelerating particles 



The flux tubes are frozen into a plasma with infinite conductivity. As a plasma 
flow u pushes the flux tubes towards the neutral line, an X-point configuration 
arises where anti-parallel held lines meet (b). As the distance between the 
flux tubes decreases, the current density (3.110) increases and may surpass 
the current density the plasma can carry. Then the current becomes unstable, 
leading to a finite conductivity. Now the frozen-in approximation breaks down 
and magnetic held diffusion starts. At the X-point, magnetic held lines merge. 
Magnetic tension leads to a shortening of the merged held lines, pulling them 
away from the former X-point (c). The energy of the terminated neutral line 
current is converted to high-speed tangential flows, indicated by ua- The 
speed of this plasma flow might exceed the local Alfven speed, forming two 
shock waves propagating away from the reconnection site. The shocks, in 
turn, might lead to particle acceleration (Sect. 7.5). 

The properties of the current sheet are determined by (3.84). For an in- 
finitesimally thin current sheet and a uniform resistivity, a self-similar solu- 
tion for the magnetic held component Bj_ perpendicular to the current sheet 
and parallel to the flow can be determined [14, 110, 113]: 



= Boerm , with ^ , (3.111) 



where Zj_is the spatial coordinate along B± and erf the error function 




(3.112) 



Since the parameter ^ depends also on the time t, there is a temporal variation 
in the width d of the current sheet. The latter can be determined by setting 



d=,f^. (3.113) 

V 

The current density associated with this magnetic held profile is a Gaussian 
centered around the middle of the current sheet and spreading with time t. 
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According to [113], the magnetic field energy available for a slice of the 
current sheet is Wb = f '^b dZx = f B^/{ 2 j 2 o) dZj_; the rate of energy conver- 
sion can be determined from (3.84) and Ampere’s law (3.44) as 

^ = -jE-jdh. (3.114) 

If the onset of reconnection does not modify the general field and plasma 
configuration, stationary reconnection results, as an equilibrium between in- 
fiowing mass and magnetic fiux, magnetic diffusion, and out-flowing mass 
and magnetic flux. This is also called steady-state reconnection. In Sweet- 
Parker reconnection [392,513], a diffusion region of width d and length L is 
assumed with L ^ similar to the configuration in Fig. 3.10. The rate of 
reconnection and the properties of the outflow can be determined from the 
conservation of mass, momentum, energy, and magnetic flux. 

Solving Ohm’s law (3.45) for the electric field and expressing the current 
density j by Ampere’s law (3.44), we obtain the following for the electric field 
sheet: 



E= — — uxB = ^ ^ ^ — u X B = uq X Bq = const . (3.115) 

a fioa 

Since we assume steady-state conditions, Faraday’s law (3.43) gives E = 
const == Eq. Outside the current sheet the conductivity is high and the mag- 
netic field is frozen into the plasma flow. Here all electric fields vanish im- 
mediately, except for the electric induction field u x B. Therefore we have 
Eq = u X B — uq X Bq outside the current sheet. Inside the current sheet 
the situation is different: here the conductivity is finite, the frozen-in condi- 
tion breaks down, and the plasma speed vanishes. Thus the induction field 
vanishes too, and according to (3.115) the electric field is 



V xB 



(3.116) 



The diffusion region is characterized by Ru < 1- Taking the width (3.113) of 
the diffusion layer as the characteristic length scale and assuming Ru 1, 
we obtain the width of the layer as 

. (3.117) 

Ho<^uo 



For a current sheet of infinite length L, as indicated in the left-hand panel 
in Fig. 3.11, the converging plasma streams would lead to a pileup of plasma 
density and magnetic field inside the current sheet. This is incompatible with 
the assumption of a steady state. Instead, an outflow of mass and magnetic 
field out of the diffusion region is required, as sketched earlier in the right- 
hand panel in Fig. 3.10. This outflow is possible only for a finite extent of 
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length (left), and a current sheet of finite length L larger than the width d (right) 



the diffusion region, as already mentioned above. The right-hand panel of 
Fig. 3.11 allows a closer look into the diffusion region. To reach a steady 
state, the outflow of plasma and magnetic field must equal its inflow, i.e. 



u X B = ua X Ba = Uq X Bq , 



(3.118) 



or in scalar form (because all velocities are perpendicular to the fields). 



uB = uaBa = , 

and from the equation of continuity. 



ua(1 = uqL . 



With (3.117), we therefore obtain 

UAd 2 ^ 

Uo- ^ - Wa > 



(3.119) 



(3.120) 



(3.121) 



where = afiQUAL is the magnetic Reynold’s number in the outflow 

region. The magnetic field in the outflow region can be determined from 
(3.119) as 

Ba = Bo — =^Bj. (3.122) 

UA L 

Since we start from d L, we also obtain Ba Bq and uq u a- The 
outflow speed can be determined from the energy balance: the inflow of kinetic 
and magnetic energy must be balanced by its outflow or, formally, 

(3.m) 



With (3.120), this gives 
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9 Bq 9 Bjs^ 

gul + — = Qu\ + — 
1^0 Mo 



Solving for ua gives 




(3.125) 



where va = Bo/y/fiog is the Alfven speed (4.38) of the incoming flow. The 
equation has two solutions, ua = uq (for d = L) and ua = va> The outflow 
speed of the plasma is equal to the Alfven speed ^’A,in in the incoming plasma 
flow, and the rate of reconnection i?sp equals the Mach number of the incident 
flow: 




Thus the reconnection process depends on the conductivity. For space plas- 
mas, where the conductivity is high, a low rate of reconnection results. The 
Sweet-Parker reconnection therefore is a slow process in which about half 
of the incoming magnetic energy is converted into kinetic energy of the out- 
flowing plasma. This acceleration leads to the two high-speed plasma flows 
indicated in panel (c) in Fig. 3.10. 

Petschek reconnection occurs in more localized regions; the process is 
faster because the length scale L is smaller; or more correctly: the length 
scale of the diffusion region equals the length scale of the system. In Petschek 
reconnection about three-flfth of the inflowing magnetic energy are converted 
into kinetic energy behind the shock waves, the remaining two-fifths is used 
to heat the plasma. The reconnection rate Rp is given as 




(3.127) 



Petschek reconnection varies less with conductivity and therefore is much 
more efficient in mixing plasmas and fields. And, with a more efficient recon- 
nection, the resulting acceleration becomes more violent, too. 

Sweet-Parker reconnection appears to play an important role at the mag- 
netopause where the high-speed flows streaming away from the reconnection 
side can be detected in situ. Petschek reconnection probably does not play a 
role in magnetospheric plasmas but might be important in solar flares. Note 
that the geometries sketched in Figs. 3.10 and 3.11 probably best are realized 
in the current sheet of the magnetotail. Geometries in flares and on the day 
side of the magnetosphere are less symmetric. 

Magnetic reconnection is not only a theoretical concept applied to various 
space plasmas; see e.g. [463]. There exist also some laboratory experiments, 
as summarized in [61]. 
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3.6 The Magnetohydrodynamic Dynamo 

Magnetic fields can be found almost everywhere in space. The magnetosphere 
could not exist without the magnetic field of the Earth, interplanetary space 
is structured by the solar magnetic field frozen-into the solar wind, and the 
Sun itself would be a boring star were it not for the magnetic field. But these 
fields are not permanent: the Sun reverses polarity in an 11-year cycle and 
polarity reversals of the Earth’s magnetic field are known too. Thus these 
fields cannot be remnants of a fossil field left from the time of the big bang. 
Instead, a mechanism is required that generates these fields and also allows 
for the (quasi-cyclic) variations. In such a magnetohydrodynamic (MHD) 
dynamo a residual seed field is amplified. The energy required for this process 
is drawn from the rotational energy of the star or the planet. Thus the motion 
of the plasma drives a dynamo, which amplifies a seed field and preserves it 
against losses. If we use the solar radius as the scale length and a conductivity 
of 2.6 A/V m in (3.86), a velocity of the order of 10“^ m/s results: thus very 
small fiow speeds are sufficient to compensate for the dissipation of magnetic 
energy. Our current understanding of MHD dynamos is summarized in [425]. 



3.6.1 The Idea 

In principle, a dynamo consists of a permanent magnet and a rotating circuit 
loop in which the current is induced. In the hot interior of the Sun and the 
planets, permanent magnets cannot exist. Thus the static magnetic field must 
be created by a current, too. Part of the current induced into the circuit loop 
than is fed back into the system to support the static field. Without such a 
feedback, the MHD dynamo would not work. 

In the core of the Sun or the planets such well-defined parts as coils 
or rotating wires do not exist. Instead, we find a homogeneous and highly 
conductive fluid, rotating with the star or planet. Thus the dynamo also is 
called a homogeneous dynamo. Since the matter inside the core is liquid, the 
question of how to create a magnetic field can be reduced to a simpler form: 
What is the nature of the plasma fiow that allows to support the required 
currents? 

Since we want to apply the dynamo to planets and stars, the model has 
to explain the most important features of their magnetic fields, such as: (a) 
the magnetic flux density increases with increasing rotation speed, (b) to 
first-order, the field is dipole like, (c) the dipole axis and the axis of ro- 
tation are nearly parallel, (d) the dynamo should allow for fluctuations in 
the magnetic field direction and flux density, and (e) polarity reversals with 
quasi-periodic but nonetheless stochastic character should be allowed. This 
latter point means that the reversal period can be identified (for instance 
11 years for the Sun and about 500 000 years for the Earth), but that the 
individual cycle lengths are distributed stochastically around this average. 
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Since the fields are axial-symmetric, a configuration as the uni-polar in- 
ductor in tempting. There a metal cylinder rotates parallel to a homoge- 
neous magnetic field, leading to a potential difference between the center 
and the mantle of the cylinder. But in the uni-polar inductor the field can- 
not be amplified. For astrophysical plasmas this is expressed by Cowling’s 
theorem [111], dating back to 1934: there is no finite velocity field that can 
maintain a stationary axial-symmetric magnetic field. The proof of this the- 
orem is based on the induction equation (3.84) which, under the conditions 
cited in Cowling’s theorem, would allow for decaying magnetic fields only. 



3.6.2 The Statistical Dynamo 

The situation is different in a statistical magnetic field: on the Sun, for in- 
stance, the turbulent motion in the convection zone modifies the field. The 
average field Bq = (B) still is axial-symmetric but it is modified by fiuc- 
tuations JBi with (Bi) = 0. Thus the magnetic field is B = Bq Bi and 
the velocity field is u = Uq The cross product of the speed and the 

magnetic field reads 



{u X B) = Uq X Bq -f- (ui X Bi) . (3.128) 

The products {ui x Bq) and {uq x Bi) vanish because the quantities with 
index ‘o’ are constant and the average of the other quantity equals zero. 
The product {ui x Bi), which is the correlation function, does not vanish 
because the fiuctuations are not independent: because the matter has a high 
conductivity, the magnetic field is frozen-into the plasma, and a change in 
the velocity field leads to a corresponding change in the magnetic field. To 
first order, the correlation function can be approximated as 



{ui X Bi) = aBo - /3V X Bo . (3.129) 

Excursion 4. As suggested by Parker [390], (3.129) can be derived as follows. 
The magnetic field equations (3.41)-(3.44) are linear in B, B, Qc, and j. The 
quantities can be split into average and fluctuating quantities and we have 
two formally identical sets of equations, one for the average field and one for 
the fluctuating field (see Sect. 4.1.4). Ohm’s law (3.45) has to be handled 
differently because it contains a product of fluctuating quantities u x B. 
Splitting Ohm’s law into an average current jo and a fluctuating current j\ 
yields 



3 = Jo+ ji 

= (J (Bq 4" Bi -|- Uo X Bq Uq X B\ T U\ X Bq -f- V>i X Bi) (3.130) 



^ A brief introduction to the mathematical basics of instantaneous quantities, av- 
erages, and fluctuations is given in Sect. 4.1.3. 
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Taking the average gives 



jo — (^{Eq uq X Bq + txi X ^i) . (3.131) 

Thus Ohm’s law for the average quantities contains an additional term, the 
correlation function between the fluctuating velocity fleld and the fluctuating 
magnetic field. The expression (3.129) for this term is derived under the 
assumption that the average velocity Uq vanishes and that the fluctuating 
velocity field is homogeneous and isotropic: neither are there points in space 
with extremely high or low levels of fluctuations nor are the fluctuations 
preferentially in one direction. 

The induction equation (3.84) for the instantaneous quantities can be 
written 



1 BB 

V^Bo + V X («i X Bo) - ^ 

/locr at 

= V^Bi - V X («i X Bi) + . (3.132) 

Hoa at 

This equation still holds if Bq and B\ are multiplied by the same factor: the 
fluctuating part B\ thus depends linearly and homogeneously on the average 
field Bq. This is also true for {u\ x B\)^ since averaging does not change the 
dependence: 

{ui X Bi) ~ ajBo . (3.133) 

Let us now assume that, to first order, B\ and thus also {u\ x ^i) at a 
certain position P depend only on Bq and U\ in a small neighborhood. Then 
{ui X Bi)\p depends only on Bq\p and {dBQ/dxi)\p. Thus (i^i x B\) must 
be proportional to V x Bq\ 

{ui X Bi) X Bq . (3.134) 

Thus, in sum, we obtain (3.129). □ 

Both a and j3 are determined by the properties of the turbulent veloc- 
ity field. The /?-term describes the increase in magnetic diffusion due to the 
turbulent motion, leading to a faster dissipation of the field. For a mirror- 
symmetric velocity field, a would vanish, but not in a rotating system, where 
the velocity field is not symmetric. Taking the average of the induction equa- 
tion (3.132) and considering the magnetic viscosity (3.88), we get 

BB 

- V X (wo X Bo + aBo) = -{v + 13)^ x (V x Bo) • (3.135) 

While (3 modifies the viscosity, the a-term contains the basic difference com- 
pared with (3.84): it allows for an electro-motoric force parallel to the average 
magnetic field; Cowling’s theorem does not apply to this equation. 
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3.6.3 The al7 Dynamo 

The basic idea of the MHD dynamo can be applied to different geometries 
and to stationary as well as periodically varying magnetic fields. Because 
we are interested in axially symmetric fields, it is reasonable to describe the 
magnetic field as consisting of a toroidal and a poloidal part: 

{B) = (JBtor) + (Bpol) = B + V X Ae^ , (3.136) 

where is the unit vector in the toroidal direction. Thus two scalar quan- 
tities, A and B, determine the three field components. With this ansatz, the 
induction equation gives two equations: one describing the ohmic dissipation 
of B and the generation of B out of A due to the a-effect and the differential 
rotation Vi7, the other describing the ohmic dissipation of A combined with 
the generation of A out of B. 

Differential rotation can occur for various reasons. The Sun, for instance, 
has a higher angular speed at the equator than at higher latitudes, and thus 
the rotation depends on latitude. The differential rotation inside the Earth 
is due to the differences in angular speed between the faster inner and the 
slower outer core. In both cases, because the field is frozen into the plasma, 
a deformation of the field line arises from the differential rotation. 

The a-effect, on the other hand, is associated with the turbulent motion 
of the plasma, in particular the upward and downward motions associated 
with convection. Although this motion is stochastic, its combination with the 
Coriolis force leads to a turbulent motion which introduces a systematic twist 
into an originally toroidal field, as shown in Fig. 3.12. The resulting magnetic 
field coil allows a current parallel to the undisturbed toroidal field. 

Inserting (3.129) into (3.131), we obtain 

jo = CF {Eq [uq X ^o) + oiBo — /3(V X Bq)} . (3.137) 

The third term on the right-hand side gives, depending on the sign of a, 
a current parallel or antiparallel to the average magnetic field Bq. With 
Faraday’s law (3.44), we can rewrite the last term on the right, and obtain 



Turbulent mass motion 




Fig. 3.12. A combination of the stochastic 
motion and the Coriolis force leads to tur- 
bulent motion (short twisted arrows) of the 
plasma which twists an originally toroidal 
field (lower line) into a coiled field which 
allows for a current parallel to the original 
field 
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Fig. 3.13. Magnetohydrodynamic 
dynamo: differential rotation de- 

forms a poloidal magnetic field into a 
toroidal one. The a-effect allows cur- 
rents parallel to the field, giving rise 
to a toroidal magnetic field in the op- 
posite direction 

j = (Jt {Eq uq X Bq + cxBq) , (3.138) 

where 

- = -+P (3.139) 

(7 

is the turbulent conductivity. Since (3 is positive [299], ctt always is smaller 
than cr: the turbulence described by the /3-term reduces the conductivity. In 
particular, for a — > oo turbulent motion would limit the conductivity to a 
finite value. Fields in a turbulent plasma therefore dissipate faster, and the 
dissipation time (3.109) becomes a turbulent dissipation time 

tt ~ . (3.140) 

Graphically, ax takes into account the fact that the turbulent motion reduces 
the length scales L of the system. 

The combination of the effects of a and i? allows us to describe the MHD 
dynamo as sketched in Fig. 3.13. We start with a poloidal field in the Sun 
at t = 0. The differential rotation deforms the magnetic field, leading to a 
toroidal field (t = T/4). The a-effect leads to electromagnetic forces parallel 
to the field, and thus a toroidal current flows (dashed lines). Although the 
magnetic held directions are opposite in the two hemispheres, the asymmetry 
of the Coriolis force leads to an asymmetric a-effect and therefore parallel 
currents in both hemispheres. This current leads to a magnetic field directed 
opposite to the original held {t = T/2). Half a cycle is now finished. This 
dynamo is called the ai? dynamo because both the a-effect and the differen- 
tial rotation contribute to the dynamo process. The dynamos inside the Sun 
and the Earth are based on this principle; their details will be discussed in 
Sects. 6.6.2 and 8.1. 

If the a-effect was not at work, the differential rotation would still trans- 
form the poloidal magnetic field into a toroidal one. However, no polarity 
reversal would occur and, in time, the entire held would dissipate. The dif- 
ferential rotation, on the other hand, is not essential to the MHD dynamo. 
The a-effect can also work with turbulent motions which, for some reason, 
have a preferred direction of motion; this is often an upwelling of magnetic 
flux combined with a particular direction of rotation of the flux tubes [299] . 

The MHD dynamo requires an initial magnetic held which is amplified by 
a suitable feedback mechanism. Thus at first glance the MHD dynamo vio- 
lates Lenz’s rule which states that all fields, currents and forces are directed 
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so as to hinder the process that leads to their induction. For instance, an 
increase in the magnetic field leads to currents which create a magnetic field 
opposite to the original one. Lenz’s rule thus stabilizes the system; it does 
not allow for the positive feedback required in the MHD dynamo. Were we to 
build such a dynamo on the basis of one process only, Lenz’s rule would be 
violated. But the MHD dynamo has the remarkable feature that although all 
individual processes obey Lenz’s rule, their sum allows for positive feedback. 



3.7 Debye Shielding 

So far, we have described a plasma in the context of one-fiuid magnetohydro- 
dynamics: the plasma consists of one particle species only and moves with 
the bulk speed. The thermal motion of the particles is neglected and thus 
there is no motion of particles relative to each other. 

We will now, though in a simple formalism, make use of the stochastic, 
thermal motion of particles in a two-component plasma consisting of electrons 
and protons. A local deviation from quasi-neutrality arises from the random 
thermal motion. Quasi-neutrality depends on the size of the volume under 
consideration. If the volume is very small, housing only one particle, quasi- 
neutrality cannot be obtained. But even if we increase the size of the volume, 
the thermal motion might lead to an excess of particles with one charge sign. 
Then the shielding of a certain particle with one polarity due to particles of 
the opposite polarity becomes important. The typical spatial scale for such 
shielding is the Debye length, already mentioned in the introduction. 

The region depleted of electrons due to their random thermal motion 
is limited in extent because the displaced electrons create an electric field 
which acts as a restoring force. Consider a sheath of width D depleted of 
all electrons. Because of their larger mass, the ions are less mobile and stay 
within this sheath. Within D therefore a positive charge exists while the 
electrons can be regarded as a surface charge collected at the boundary of 
the sheath. The electric field is different from zero within the sheath; outside 
the sheath the field of the positive ions is screened by the surface charge. 

The energy in the electric field stems from the kinetic energy of the ther- 
mal electron motion. With rie as the electron density in the undisturbed 
plasma, the kinetic energy of the electrons in a sheath of thickness D is 
nekBTD/2. If all electrons are removed from this sheath, a restoring force 
proportional to D acts on them. The energy contained in the electric field 
created by the charge separation depends on Thus there is a certain 
width Ad at which the energy contained in the field equals the kinetic energy 
of the electrons originally present within this region: the kinetic energy of the 
thermal motion of the electrons is converted entirely into field energy.^ 

^ Note that this is different from the discussion of the plasma oscillations in 
Sect. 4.3.1 because here the sheath depleted of electrons results from their ther- 
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The electric field created by the ions inside the sheath is V-.E = eZnijeo. 
Thus the Coulomb potential (/? can be written as = —eZn\leo. In the 
one-dimensional case, the potential is 



ay 

dx‘^ 



—eZnijeo = const . 



(3.141) 



The general solution of this equation is (p{x) = —Zen\x^j2e{) -h C\x -f C 2 , 
with the constants determined by the boundary conditions. The coordinate 
system is fixed such that the potential vanishes at x = 0 and C 2 is zero. 
If the field is symmetric around x = 0, then C\ is zero, too. In addition, 
the transition between the potential inside and outside the sheath has to be 
steady. Thus the potential can be written as 






_ f —ZeUiX^l2e{) 
\ -ZeriiD‘^ /Seo 



The electric field is 






for |x| < D/2 
for jxj > D/2 



for |x| < D/2 
for jxj > D/2 



(3.142) 



(3.143) 



The energy contained in the electric field can be determined as 



+D/2 

-D/2 



e^Z‘^nf 

2eo 



+D/2 

/ x^dx 

-D/2 



6eo 



(3.144) 



We are now looking for the thickness Ad of a sheath where the field energy 
equals the kinetic energy of the electrons originally present inside the sheath: 



ne^BTAp _ e‘^Z‘^nfXl 

2 fi^o 



(3.145) 



Since the above geometry is one-dimensional, only one degree of freedom is 
considered in the kinetic energy. Quasi-neutrality requires Uq — Zrii, and thus 
the Debye length Ad is given as 



Ad 



SeoksT 



k^T 1 



m u. 



(3.146) 



pe 



where is the angular frequency of electron plasma oscillations (Sect. 



4.3.1). 

Though derived for the one-dimensional case, this equation also holds in 
the three-dimensional case where the Debye length can be interpreted as the 



mal motion. In plasma oscillations, instead, a cold plasma is considered and 
the charge separation results from an external force, e.g. a beam of electrons 
travelling through the plasma. 
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maximal radius of a sphere which, due to thermal motion, might be depleted 
of electrons. In spatial regions small compared with the Debye length, quasi- 
neutrality is likely to be violated, while on larger scales the plasma is quasi- 
neutral. In the latter case the kinetic energy contained in the thermal motion 
is not large enough to disturb the particle distribution over the entire region. 
Figure 1.1 shows typical values of the Debye length and numbers of particles 
inside a sphere with the Debye radius for different electron densities and 
temperatures. Some typical plasmas are indicated. 

The Debye length also can be used to assess the influence of an instrument 
on measuring plasma parameters. Let us start from an initially cold plasma, 
i.e. the motion of the electrons can be ignored. Let us now insert a small 
positive charge q into this plasma. Immediately, it will be surrounded by a 
cloud of electrons while the ions are repulsed. The electron cloud screens the 
additional charge; thus, outside the electron cloud its electric field vanishes. 
If we now increase the temperature, the electrons gain thermal velocity. Deep 
inside the electron cloud this velocity will be too small to overcome the at- 
traction of the positive charge. At the outer edges of the cloud, on the other 
hand, the thermal energy might be large enough to exceed the electrostatic 
potential of the partly screened charge, allowing an escape from the cloud. 
The Debye length then can be interpreted as the spatial scale over which 
the potential of the point charge q has decreased by a characteristic value 
(see Fig. 3.14): within the Debye length, electrons are influenced by the test 
charge, while at larger distances the test charge goes unnoticed. 

A more general definition would read: only charged particles within a 
distance of Ad exert an electrostatic force on each other. This is different 
from bodies which interact gravitationally, like, for example, interacting stars: 
gravitation cannot be screened by repulsing forces, it has an indefinite range. 

Debye screening also is important in preventing local clusters of charges. 
For eif /ubT the influence of the electric field on the particle energy is small; 
thus the particle motion is determined by the thermal speed. The Debye 
shield arises from small differences in the particle motion: some particles with 
opposite charge stay close to the test charge slightly longer, while particles 
with equal charge move away a little bit faster. Charge inhomogeneities in a 
plasma therefore are balanced on the scale of the thermal propagation time. 




Fig. 3.14. The electric po- 
tential of a test charge is 
reduced by the surround- 
ing plasma 




Exercises and Problems 
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3.8 Summary 

Magnetohydrostatics is concerned with the energetics of the electromagnetic 
field without allowing for the collective motion of the plasma. Basic concepts 
derived from this approximation are the magnetic pressure and the mag- 
netic tension. Magnetic pressure, graphically interpreted as mutual repulsion 
of field lines, prevents a magnetic field from being compressed by external 
forces. If part of a magnetic line of force is displaced from its original posi- 
tion, the magnetic tension creates a restoring force, which graphically can be 
interpreted as the tendency of a line of force to shorten itself. 

Magnetohydrokinematics assumes the energy density of the plasma to be 
much larger than the energy density of the field, which allows us to ignore the 
infiuence of the field on particle motion. The basic concepts are as follows: 

(a) If the conductivity of the plasma is infinite, the magnetic field lines are 
frozen-into the plasma. Thus the plasma fiow carries away the magnetic field. 

(b) If the conductivity is finite, the magnetic field is deformed by the plasma 
fiow until diffusion allows the plasma to fiow across the field lines, (c) In 
a stationary plasma, the magnetic field dissipates, with the dissipation time 
depending on the square of the linear dimensions (small fields dissipate faster) 
and linearly on the conductivity (if the latter is infinite, the dissipation time is 
infinite, too, and the field is frozen-in). These concepts are important, e.g. in 
our understanding of the merging of magnetic field lines, called reconnection, 
and in dynamo theory. 



Exercises and Problems 

3.1. Explain the difference between convective and partial derivatives. Find 
examples to illustrate the differences. 

3.2. Recall simple hydrodynamics and give other examples of the momentum 
balance. Discuss the different forms and compare with the Navier-Stokes 
equation. 

3.3. Derive the hydrostatic equation from the Navier-Stokes equation. Which 
terms do you need? 

3.4. Give a quantitative discussion of the stability of a sunspot (all important 
numbers are given in Table 6.1). 

3.5. Is the filament sketched in Fig. 3.5 realistic? Why does it not dissolve 
towards the sides (remember, it is a plasma, not a solid body)? 

3.6. What is the meaning of viscosity and Reynolds number? What are 
the formal differences between hydrodynamics and magnetohydrodynamics? 
What are the differences in substance? 
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3 . 7 . Explain the consequences of stationary flows parallel and oblique to the 
magnetic fleld. 

3 . 8 . Why has pressure the units of an energy density? 

3 . 9 . Show that (3.76) is a solution of (3.75). 

3 . 10 . Show that in an ideal, non-relativistic magnetohydrodynamic plasma 
the ratio between the electric and the magnetic energies is (t’x/c)^. 

3 . 11 . Determine the dissipation times for a copper block (side length 10 cm, 
conductivity 260 A/Vm) and the interstellar medium (linear dimension 
10^^ m, conductivity 2.6 jiV/Am). Compare with the age of the universe 
(about 10^^ s). 

3 . 12 . Determine the Debye length and the number of particles inside a Debye 
sphere for electrons and protons moving with thermal speeds (see Sect. 5.1.2) 
in the following flelds: (a) the Earth’s magnetosphere with n = 10^ cm“^, 
T = 10^ K, B = 10“^ G; (b) the core of the Sun with n = 10^^ cm“^, 
T = 10^-^ K, B = 10® G; (c) the solar corona with n = 10® cm“®, T = 10® K, 
B = 1 G; and (d) the solar wind with n = 10 cm“®, T = 10® K, B = 10"® G. 
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Roll on, thou deep and dark blue Ocean - roll! 
Ten thousand fleets sweep over thee in vain; 
Man marks the earth with ruin - his control 
Stops with the shore. 
Lord Byron, Childe Harold’s Pilgrimage 



In this chapter we shall catch a glimpse on the vast zoo of plasma waves. 
The formalism to derive these waves, perturbation theory, briefly will be 
introduced. We will not derive all types of waves formally; instead, we shall 
limit ourselves to the magnetohydrodynamic waves, which are Alfven waves 
and ion acoustic waves. The derivation of the dispersion relations for other 
types of waves follows the same scheme, it only differs in the terms considered 
in the equation of motion, in the assumptions made in the equation of state, 
and in whether a one-fluid description of the plasma is sufficient or if a two- 
fluid description is required. Detailed derivations can be found e.g. in [36,97, 
191,192,298,397,504,512,534]. For the purpose of this book, however, it is 
more important to grasp the nature of the waves than to Addle around with 
the mathematical tricks involved in solving the equation of motion. 

This chapter is limited to elementary types of waves. First, the geometry 
always is simple: in a magnetized plasma the waves either propagate parallel 
or perpendicular to the undisturbed magnetic held - oblique waves are not 
considered here. Physically more important is the limitation to small distur- 
bances, i.e. small amplitude waves: the basic set of magnetohydrodynamic 
equations is a set of coupled non-linear partial differential equations. Thus 
in principle we can expect non-linear couplings between different fluctuating 
quantities of the wave. If we limit our discussion to small amplitude waves, 
the equations can be linearized: whenever two oscillating quantities are mul- 
tiplied, since both are small, we consider this a higher order term and ignore 
it. If we apply these results to a real situation, we always have to take one step 
back and justify whether the amplitudes calculated in our real situation are 
small enough so that the non-linear terms actually are negligible compared 
with the linear ones. 

In a plasma, a large variety of waves exists. A simple phenomenological 
classification in transversal and longitudinal waves is insufficient. Instead, 
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we have to consider the conditions under which certain types of waves can 
exist. For instance, in a cold plasma the thermal motion of the particles 
and therefore the pressure vanishes. Thus elastic waves cannot exist; they 
can form only in a warm plasma where a pressure gradient can build up. In 
an isotropic plasma, no magnetic field exists, which allows other modes of 
propagation than an anisotropic plasma. As a third criterion, we also have 
to consider which particle species is in motion. 

4.1 What is a Wave? 

For a start, let us briefiy repeat the basic descriptions and properties of 
waves. A wave is a disturbance propagating through a continuous medium. 
It gives rise to a periodic motion of the fiuid. Even a complex oscillation, at 
least as long as the amplitude of the disturbance is small, can be decomposed 
into different sinusoidal waves by the technique of Fourier analysis. Small- 
amplitude disturbances lead to plane waves with the direction of propagation 
and the amplitude being the same everywhere. 

4.1.1 Wave Parameters 

A sinusoidal wave is described by its frequency uj and its wave vector fe: 

B{r, t) = Bo exp {i(fe • r — uot)} . (4.1) 

The measurable quantity is the real part of this complex expression. The 
exponent in (4.1) is the phase of the disturbance. The temporal derivative 
of the phase gives the frequency u, and the spatial derivative gives the wave 
vector fc that specifies the direction of wave propagation. A surface of constant 
phase, also called a wave surface, is displaced by the phase velocity Uph, which 
can be determined from d(fe • r — ut)/dt as Vph. — ojjk or, in vector form, 

^Ph ~ 

If cj/fc is positive, the wave moves to the right; for negative a;/ A: it moves to the 
left. For electromagnetic waves the refraction index n is defined as the ratio 
between the speed of light and the phase speed of the wave: n = c/uph = ck/uj. 

The phase velocity can exceed the speed of light. This is not in contra- 
diction to the theory of relativity because an indefinitely long wave train of 
constant amplitude does not carry information. Information can be carried 
by a modulated wave, on which variations in frequency or amplitude are su- 
perimposed. We can regard each bit of information in this modulated signal 
as a wave packet which then moves with the group velocity 

du 

M ‘ 






(4.3) 
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Fig. 4.1. Dispersion relation 
with a resonance at the fre- 
quency (Jres- The slope of the 
curve gives the group velocity 
Ug, as indicated by the dashed 
line 



The group velocity Vg given by the slope of the dispersion relation (see dotted 
line in Fig. 4.1). It is always less than the speed of light and determines the 
velocity with which the energy of the wave is transported. 

The most important tool in the description of waves is the dispersion 
relation uj — uj(k)^ relating frequency and wave vector. From this relation 
the group and phase velocities of a wave can be determined. The dispersion 
relation contains the physical parameters of the medium under consideration. 
If the plot of the dispersion relation shows an asymptotic behavior towards 
a certain frequency c^res, as depicted in Fig. 4.1, there is a resonance at this 
frequency: as duj/dk converges towards zero, the wave no longer propagates 
and all the wave energy is fed into stationary oscillations. 



4.1.2 Linearization of the Equations; Perturbation Theory 

A wave is a disturbance of the medium with a certain speed, amplitude, and 
frequency. Thus the parameters of the medium, such as pressure, density, and 
the electromagnetic field, can be described by an average state, indicated by 
the index “0”, and a superimposed disturbance, indexed as “1”: 

B — Bq + Bi , E = Eq + El , u = uq 4 - ui , ( 4 - 4 ) 

3 = jo +ji , Q= Qo + 01 , P = Po+Pi , (4.5) 

with {ui) = {ji) = (El) = (Bi) — (qi) = (pi) = 0. The resulting MHD 
equations (3.41)-(3.48) are difficult to solve. If we limit ourselves to small 
disturbances, i.e. u\ < uq, Bi < Bq, Ei < Eq, ji < Jq, qi < Qq, and 
Pi < Po? we can derive two sets of equations which are more convenient: 
the set describing the equilibrium state of the undisturbed medium contains 
quantities with index “0” only, and a second set for the fluctuating quantities 
contains fluctuating quantities and products of undisturbed and fiuctuating 
quantities. Products of fiuctuating quantities are ignored because they are 
small in second order. 



4.1.3 Reynolds Axioms 

We have decomposed the instantaneous quantities x into an average Xq and 
a fluctuating part Xi. By definition, the average of x is (x) = Xq and the 




92 



4 Plasma Waves 



average of the fluctuations Xi vanishes: (xi) = 0. All our MHD equations 
(3.41)-(3.48) are given for the instantaneous quantities. To derive equations 
for average quantities, the instantaneous quantities have to be expressed in 
terms of averages and fluctuating parts, and the resulting equation has to be 
averaged. In doing this, some general rules must be obeyed, described by the 
Reynolds axioms: 

1. The average of the sum of two instantaneous quantities equals the sum 
of the averages: 



{A + B) = {A) + {B) = Ao + Bo. (4.6) 



2. The average of the product of an average quantity and a fluctuating one 
vanishes: 



(Aq Bi) = {Aq) {Bi) = Ao 0 = 0 , 

{Aq • Bi) = (Aq) • {Bi) 1= Ao • 0 = 0 , 

(Ao X Bi) = (Ao) X (Bi) = Aox0 = 0 . (4.7) 

3. The average of the product of two averages is the product of the averages: 

((A) {B)) = {A) {B) = AoBo. 

((A) {B)) = {A)^{B) = Ao-Bo. 

{ (A) X (^) ) ^ (A) X (B) = AoX Bo. (4.8) 

4. The average of the product of two instantaneous quantities equals the 
product of the averages plus the average of the product of the fluctuating 
quantities: 

(AB) = ((Aq + Ai) (^0 + -^i)) — {AqBq + AqBi + AiBq + AiBi) 

= {AqBo) -h {AqBi) + (AiBq) + (AiBi) = AqBq + (AiBi) , 
(A.B)=Ao-^o + (Ai.Bi), 

{A X B) ^ Aq X Bo A (Ai x Bi) . (4.9) 

The last term contains the average of the product of the fluctuating 
quantities. This is called the covariance or the correlation product. Thus 
we also have a definition for the correlation product of two quantities x 
and y : 



{xiVi) = {{x - xi){y - yi)) - xoyo , 

{xi ■ yi) = {{x - xi) ■ (y - yi)) -xo-yo , 

{xi X yi) = {{x - xi) X (y - yi)) - xq x yo . (4.10) 

5. The average of the derivative of an instantaneous quantity equals the 
derivative of its average: 

/aA\ _ d{A) _ dAo 

\dc /- dc ~ dc - 



(4.11) 
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6. The average of the integral of an instantaneous quantity is the integral 
of the average: 



(^1 Adcj = J{A)dC = I AodC. (4.12) 

4.1.4 Linearized MHD Equations 

The MHD equations (3.41)-(3.48) for the undisturbed quantities are 



V X Bo = ^o3o , 


(4.13) 


V X £?o = 0 , 


(4.14) 


<1 

to 

o 

II 

O 


(4.15) 


— — Eo + Mo X Bq , 
a 


(4.16) 


Qo {uo ■ V)«o = - Vpo + jo X Bo , 


(4.17) 


V • {qoUo) = 0 , 


(4.18) 


Po = Cg2" . 


(4.19) 


Note that in the equation of motion friction and gravity have been ignored. 
The equation of state describes the plasma as an ideal gas with all changes 
occurring adiabatically. 

The undisturbed medium is assumed to be homogeneous in pressure, den- 
sity, and magnetic held. Since it is assumed to be at rest {uq = 0), the current 
jo vanishes (4.13) and the undisturbed electric held vanishes too (4.16). Note 
that this is just another expression for the high conductivity of the plasma 
(a -> oo). The equations for the fluctuations then read 


V X Bi = /ioJi , 


(4.20) 


^ ^ dBi 

= - ' 


(4.21) 


VB, = 0, 


(4.22) 


El — —Ui X Bq , 


(4.23) 


dui _ . _ 

^0-^ ^ -Vpi + ji X Bo , 


(4.24) 


^ = -V . (»u0 . 


(4.25) 


Pi Qi 

— = 7a— . 
Po Po 


(4.26) 



Now (4.20)-(4.26) is a homogeneous linear system of equations for the fluc- 
tuating quantities. Since neither time nor the spatial coordinate are explicit 
in one of the equations, the system can be solved by an exponential ansatz. 
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4.2 Magnetohydrodynamic Waves 

Magnetohydrodynamic (MHD) waves are low-frequency waves related to the 
motion of the plasma’s ion component. They can be understood intuitively 
from the concepts of magnetic pressure (Sect. 3.3.1) and magnetic tension 
(Sect. 3.3.2): in a magneto-sonic wave, compression of the field lines creates 
a magnetic pressure pulse which propagates perpendicular to the field in the 
same way an ordinary pressure pulse propagates through a gas in a sound 
wave. Magneto-sonic waves therefore are longitudinal waves: the disturbance 
is parallel to the propagation direction of the wave. The displacement of 
part of a field line in an Alfven wave is similar to plucking a string: magnetic 
tension, like the tension in a string, acts as a restoring force and a transversal 
wave propagates along the field line. 

Despite the simplicity and graphic quality of these concepts, we shall treat 
these waves formally in a concept which is useful for small disturbances: the 
linearization of the equations. The MHD waves provide just one example for 
this concept; it is applied in a more elaborate way also in the quasi-linear 
theory (QLT) of wave-particle interaction (Sect. 7.3.4). 



4.2.1 Alfven Waves 

Alfven waves are transversal waves propagating parallel to the magnetic field 
(see Fig. 4.2). The magnetic tension acts as the restoring force. The fiuctu- 
ating quantities are the electromagnetic field and the current density. 

To derive the properties of Alfven waves, we have to solve the equations 
for the fiuctuating quantities. Let us start with the equation of motion (4.24). 
The pressure gradient force vanishes because the Alfven wave is limited to 
fluctuations in the magnetic field but not in the gas-dynamic pressure. If we 
express the current ji by Ampere’s law (4.20), the momentum balance reads 



(a7 



(e) 






(f) 





Fig. 4.2. Alfven waves depicted as an oscil- 
lating string or an elastic rope 
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o -j 

= 31 X Bi = —(V X Bi) X Bi . (4.27) 

at fiQ 

Combining Ohm’s law (4.23) with Faraday’s law (4.21) yields 

^ = V X (txi X Bo) . (4.28) 

The remaining equation is Gauss’s law for the magnetic field (4.22): 

V • JBi = 0 . (4.29) 

The equation of state (4.25) is not considered here because we are not con- 
cerned with fiuctuations in pressure and density, only in field. 

Equations (4.27)-(4.29) can be solved by means of a Fourier transforma- 
tion. If we assume that the solutions are plane waves, temporal and spatial 
derivatives can be substituted according to 

0/ dt — y — iuj , V — y ik , V* — y ifc* , V x — y ik x . (4.30) 

Equations (4.27)-(4.29) then read 

—iujQoUi = — (fe X Bi) X Bo , (4-31) 

Mo 



iouBi =ik X {ui X Bq) , (4.32) 

fe • J5i = 0 . (4.33) 

Alfven waves propagate along the field, thus fc||Bo- With V • Bq = 0 the 
above equations can be reduced to 

ujui = -^—{Bo • Bi)k , (4.34) 

Mo^o 

luBi = {k-Ui)Bo , (4.35) 

k‘Bi=0 . (4.36) 

Let us now multiply (4.34) by k and (4.35) by Bq- Adding both equations 
gives the dispersion relation for the Alfven wave: 

Mo^o 



Alfven waves are non-dispersive waves. Thus group and phase speed are the 



same. This Alfven speed 



VA = 




(4.38) 



is an important characteristic of a plasma: it is the maximum speed of a 
disturbance propagating along the magnetic field and can be compared with 
the speed of sound in a gas: if a disturbance propagates faster than the Alfven 
speed, a shock wave develops (Sect. 6.8). Typical Alfven speeds are some tens 
of kilometers per second in the interplanetary medium and some 100 km/s 
in the solar corona. 
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Example 1 5. Some average parameters of the solar wind are a number density 
of 8 cm“^, a magnetic field B = 7 nT and a temperature of about 2 x 10^ K. 
The density of the solar wind is then q = nrrip and the Alfven speed is 






(7 nT)2 



47t X 10“'^ (V s/A m) 



X 8 X 1.673 X 10-27 



= 54 km/s . 



(4.39) 

Here we have used the fact that the unit T can be expressed as V s/m^ and 
that the product of the electrical units volt and ampere gives the watt, which 
can easily be expressed as a mechanical unit. Table A.4.3 provides some help 
on units expressed in different forms. □ 



4.2.2 Magneto-Sonic Waves 



A magneto-sonic wave is similar to a sound wave: it is a longitudinal wave 
parallel to the magnetic field with alternating regions of compression and 
rarefaction in both the plasma and in the magnetic field (see Fig. 4.3). 

Since we allow for a compression of the plasma, we also have to consider 
the equation of state pi = 7a^iPo/^o = (4.26), with 



Vs = 




(4.40) 



being the (adiabatic) sound speed. To solve the equation of motion (4.24), 
we have to express pi and ji x Bi by Ui. 

Fourier transformation of the equation of continuity gives icogi = ikgoUi 
or, combined with the equation of state and (4.40), 

Pi = r^ikgoui : (4.41) 

lUJ 

the fluctuating pressure pi in the momentum balance can be expressed by 

Ui. 

Ohm’s law (4.23) yields the dependence of Ei on ui. Combined with 
Faraday’s law (4.21), Bi can be expressed as a function of Ui: 



V X El = -V X {ui X Bo) = - 



dBi 

dt 



(4.42) 






Fig. 4.3. Sketch of a magneto-sonic wave: 
undisturbed field (left) and fluctuating field 
(right) 
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Transformation yields 



ife X {ui X Bq) = iuBi . (4.43) 

This expression for Bi can be inserted into Ampere’s law (4.20): 

Moil = -ifc ^ (“ ^ “1 ^ • (4.44) 

With (A. 26), this can be simplified to 

= k X ui - k X Bq . (4.45) 



Here we have used k ± Bq and therefore k • Bq = 0 and k x Bq = kB^. The 
vector product of (4.45) and .Bq yields 



jl X Bq 



k^B^ui . 



(4.46) 



Thus the equation of motion (4.24) combined with (4.41) 
written as 



2 

— io;^o^i = ui — 

UJ 



B^ie 

ui 

llQUJ 



and (4.46) can be 



(4.47) 



giving the dispersion relation for the magneto-sonic wave. The phase speed 
is determined by the squared sound and Alfven speeds: 



V 



2 

ms 






= 4 • 



(4.48) 



It is independent of frequency or wave number: the wave is dispersion- free. If 
the magnetic field vanishes, the Alfven speed approaches zero and the phase 
speed of the magneto-sonic wave converges towards the speed of sound. This 
wave is called the slow magneto-sonic wave. If the magnetic field is strong, the 
phase speed of the magneto-sonic wave becomes the Alfven speed. But the 
wave still behaves differently because it propagates perpendicular to the field 
instead of parallel to it. This latter kind of wave is also called the compressive 
Alfven wave. Since the phase speed of the magneto-sonic wave exceeds the 
Alfven speed, it often is called the fast magneto-sonic wave or just the fast 
MHD wave. 



Example 16. Let us briefiy return to example 15, where we have already de- 
termined the Alfven speed in the solar wind. To determine the speed of the 
magneto-sonic wave, we also need the sound speed. From (4.40) we obtain 
= \/ 7 a^A:BT/(nmp) = — 70 km/s. For the speed of the 

magneto-sonic wave, we obtain ^’^ls = 88.7 km/s from (4.48). □ 
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4.2.3 MHD Waves Oblique to the Field 

So far, we have considered two special geometries: a transversal wave propa- 
gating parallel to the field and a longitudinal wave propagating perpendicular 
to it. But MHD waves can propagate at any angle relative to the field. For 
this general case, we have to solve the whole set of equations (4.13)-(4.19) for 
the undisturbed and (4.20)-(4.26) for the fiuctuating quantities; neither one 
of the equations nor a term in one of them can be ignored. Again, a solution 
can be obtained by combining the equations into one equation for the desired 
quantity, the speed of the fiuctuations. The dispersion relation obtained from 
this equation yields solutions for the above wave types with phase speeds 
depending on the angle Q between the wave vector and the undisturbed field. 
The speed of the MHD wave is determined by 

— (v\ -h -h v\v\ cos^ 9 = 0. (4.49) 

For 9 = 90°, the fast magneto-sonic wave (4.48) results. For 0 = 0°, two 
solutions exist: the Alfven wave with u = va and the sound wave with u = Vs. 

The different solutions can be represented in a hodograph or Friedrichs 
diagram (see Fig. 4.4). The hodograph is a polar diagram of velocities with 
the polar angle 9 relative to the undisturbed magnetic field direction and 
the wave’s phase speed as distance from the origin. The wave front then 
propagates perpendicular to the velocity vector. Depending on whether the 
Alfven speed is greater or less than the sound speed, two sets of solutions 
arise (see Fig. 4.4). For almost any direction we find three different phase 
speeds, corresponding to an Alfven wave and a slow and a fast mode magneto- 
sonic wave. For 9 = 90° or 270° only the fast magneto-sonic wave exists. As 
can be seen from the hodographs, the Alfven speed ua = v a cos 9 always 





Fig. 4.4. Friedrichs diagram (hodograph) representing the different types of MHD 
waves for Alfven speeds greater than the speed of sound (a) or less (b) 
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is between the phase speeds of the fast (outer curve) and the slow (inner 
curve) magneto-sonic wave. For waves propagating parallel to the field, the 
Alfven speed becomes identical to the fast magneto-sonic speed for va > Vs- 
This does not contradict (4.48) because the latter had been derived for waves 
propagating perpendicular to the field, the direction in which the magneto- 
sonic speed becomes maximal. The speeds of the Alfven wave and the slow 
mode magneto- acoustic wave, on the other hand, are maximal in the direction 
parallel to the field. The Alfven speed of waves parallel to the field equals 
the slow mode speed for va < Vs- 



4.3 Electrostatic Waves in Non-magnetic Plasmas 

Electrostatic waves start from a charge imbalance in an initially quasi-neutral 
fiuid element. This charge imbalance accelerates the electrons and ions in its 
neighborhood, resulting in charges oscillating back and forth. Since these 
oscillations only involve the electric field, they are defined as electrostatic 
waves. The oscillating magnetic field is zero. Electrostatic waves can occur 
in non-magnetic plasmas (this section) or in magnetized plasma (Sect. 4.4). 
Fourier transformation of Faraday’s law (4.21) for the fiuctuating quantities 
of the wave yields ifc x JEi == iuBi = 0, thus the fiuctuating electric field is 
parallel to the wave vector k. 

4.3.1 Plasma Oscillations 

Plasma oscillations, also called Langmuir oscillations, are a prime example of 
a plasma phenomenon that requires consideration of both types of charges. 
Nonetheless, this does not automatically imply that plasma oscillations can 
be described only in the framework of two-fluid theory. Since the ions are 
assumed to be stationary, we do not have to consider their equation of motion. 
Thus a one-fluid description of the electrons is sufficient as long as the electric 
field created by the ions is considered. 

Let us assume that ions and electrons are distributed equally in space. 
Thus quasi-neutrality is fulfilled even in rather small volumes. In addition, 
we assume that the thermal motion of the particles vanishes; the fluid is 
treated as a cold plasma. Such a fluid can be disturbed by the displacement 
of part of the electrons, as indicated in Fig. 4.5. This displacement creates an 
electric field that pulls the electrons back to their initial rest positions while 
the heavier ions stay in their positions. Since the electrons are accelerated 
along the field, they gain kinetic energy which in turn drives them behind 
their initial position, creating an electric field in the opposite direction. This 
field slows down the electrons, eventually driving them back. The period of 
the resulting electron oscillation around their rest position is the electron 
plasma frequency cjep. It can be derived from the equation of motion 
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Fig. 4.5. The displacement of electrons in a cold plasma leads to plasma oscillations 



rue-^ = -eE , (4.50) 

where x is the direction parallel to the electric field. E can be determined 
by applying Gauss’s law to a closed surface along the boundary between the 
positive and negative charges, and extending at least for the displacement x 
of the charges: E = 47rne ex/so- The equation of motion then is 



d^x 

dt‘^ eoTTle^ 



(4.51) 



This second order differential equation describes a wave and can be solved 
with an ansatz x = Xoe ^^^ . It describes a harmonic oscillator with the electron 
plasma frequency 









Some 



(4.52) 



The electrons therefore oscillate with a frequency which neither depends on 
the wave-length nor on the amplitude of the disturbance responsible for their 
initial displacement. 

Plasma oscillations are an important tool in plasma diagnostics because 
they allow us to measure the electron density Uq. 



Example 17. Solar radio bursts (Sect. 6.7.1) are an important tool for the di- 
agnosis of coronal disturbances. For the lower corona, we can use a simplified 
density model n ~ noexp(-r/ro) with a scale height of ro = O.lrQ and a 
density no = 10^^ m^ at r = Ir© (more correctly, this applies at the base of 
the corona 2000 km above the solar surface, but for the numerical exercise 
this 2000 km can be ignored). The radio emission observed from the ground 
is in the frequency range 10 MHz to about 200 MHz. According to (4.52), 
this corresponds to coronal heights between 2ro and O.Sro above the surface. 
The main mechanisms for the generation of solar radio bursts are electron 
beams with speeds of about c/3 (type III burst) and coronal shock waves 
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with speeds of the order of 1000 km/s (type II bursts). The height range 
over which these exciters propagate is 1.2ro = 0 . 12 rQ 8.4 x 10^ km. The 

electrons with speed c/3 travel this distance within about 0.3 s, and thus 
the type III burst shows an extremely fast frequency drift of the order of 
750 MHz/s. The shock wave, on the other hand, needs almost 9 s to travel 
this distance, corresponding to an average frequency drift of about 20 MHz/s. 
Thus both types of bursts can be easily identified by their frequency drift in 
radio spectrograms. □ 



4.3.2 Electron Plasma Waves (Langmuir Waves) 



Plasma oscillations have been derived for a cold plasma: the group velocity 
equals zero and the disturbance does not propagate. In a warm plasma, the 
situation is different. The thermal motion of the electrons carries information 
about a disturbance into the undisturbed ambient plasma. The disturbance 
then propagates as a wave. Formally, one can derive the dispersion relation 
for this wave by adding the pressure gradient force — Vp to the equation of 
motion (4.50). If the plasma behaves adiabatically, the dispersion relation for 
the plasma wave in the one-dimensional case ( 7 a — 3) reads 



2 2 , 3/22 

^ = ‘^pe + ^th . 


(4.53) 


or in the three-dimensional case 




2 2 1 5 7 2 2 

^ = ^pe + . 


(4.54) 



with the thermal velocity Vth. = y/2fcBT/me (see Fig. 4.6). Both equations are 
based on the assumption that locally a Maxwell equilibrium is established. 
Thus the plasma must allow for frequent collisions. In space plasmas, in 
general this is not the case. Here the correct dispersion relation is 



= <^pe + • (4-55) 

This equation is called the Bohm-Gross equation [53]. From (4.55) the group 
velocity of plasma waves can be determined as 



_ ^ 2 _ ^ 
^ dk w i;ph 



(4.56) 




Fig. 4.6. Dispersion rela- 
tion for electrostatic elec- 
tron waves in a warm un- 
magnetized plasma {Lang- 
muir waves) 
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As expected, the group velocity vanishes as the thermal energy of the plasma 
approaches zero. The group velocity always is significantly smaller than the 
thermal speed and thus also much smaller than the speed of light. For large 
wave numbers fc, the information propagates with the thermal velocity. For 
small wave numbers, the information travels slower than i;th because the 
density gradient decreases for large wavelengths and therefore the net flux of 
momentum into adjacent layers becomes small. 



4.3.3 Ion- Acoustic Waves (Ion Waves) 



In Langmuir waves, as in plasma oscillations, the ions are assumed to be in- 
definitely massive; they stay fixed at their position. Langmuir waves thus are 
high-frequency waves. If we allow for ion motion, the properties of the wave 
change. The inertia of the ions requires rather slow oscillations. Therefore, ion 
waves are low-frequency waves. In ordinary fluids, sound waves are the coun- 
terpart of the ion wave. They can be derived from the Navier-Stokes equation 
with the pressure gradient force being the only term on the right-hand side. 
The dispersion relation than reads Vs = ujfk y/laJPo! 
with Vs being the speed of sound. 

Sound waves are pressure waves. They transport momentum from one 
layer to the next due to collisions between molecules or atoms. Despite its 
often low density, in a plasma a similar phenomenon exists. Here the mo- 
mentum is transported by Coulomb collisions; thus information is contained 
in the charges and the fields. Since we have to consider the motion of both 
electrons and ions, the ion wave can be derived in the framework of two-fluid 
theory only. In the equation of motion, we have to consider the pressure gra- 
dient force and the force exerted by the electromagnetic field. The dispersion 
relation than can be derived as 



^ ^ lekB,Te + 
k‘^ rrii 



(4.57) 



Note that 7 e ~ 1 because the electrons are fast compared with the waves and 
an isothermal distribution is established easily. The ions, on the other hand, 
experience a one-dimensional compression in the plane wave, and 71 = 3. The 
group velocity of the ion wave is independent of the wave number k. 

If the ion population can be considered as cold, T[ -> 0, and the wave 
length is small, fcAo ^ 1 , the ions can oscillate with the ion plasma frequency 



2 2 

U) = U) • = 



n 






sorrii 



(4.58) 



The dispersion relation for ion waves is shown in Fig. 4.7. The ion plasma 
frequency as asymptote for short wavelengths is indicated in the right part 
of the figure. There are fundamental differences in the dispersion relation 
for electron and ion waves. Electron plasma waves are waves with constant 
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Fig. 4.7. Dispersion rela- 
tion for ion waves in a warm 
unmagnetized plasma. The 
dashed lines are asymptotes for 
small and large wave numbers 



frequency (see Fig. 4.6), the thermal motion only adds a small correction. 
In contrast, ion waves are waves with a constant speed and require thermal 
motion. The phase speed and group speed are identical. The difference be- 
tween these two types of waves is due to the behavior of the second particle 
component: in the electron plasma wave, the electrons oscillate while the ions 
stay fixed. In the ion wave, the ions oscillate but the electrons are in motion 
too. In particular, they can be carried along by the ions, screening part of 
the electric field resulting from the ion oscillation. 

In space plasmas, ion acoustic waves are observed upstream of planetary 
bow shocks where they are generated by suprathermal particles streaming 
away from the shock front (Sect. 7.6.4). 

Example 18. The proton density upstream of a shock is ni = 8 cm“^, and for 
the proton Z{ = 1. Ion acoustic waves (upstream waves) excited by particles 
streaming away from the shock then have a ion acoustic frequency == 
3.7 X 10^ Hz. □ 



4.4 Electrostatic Waves in Magnetized Plasmas 

Let us now consider electrostatic waves in a magnetized plasma. They can 
be divided into waves with k parallel or perpendicular to Bq. The terms 
longitudinal or transversal refer to the direction of the wave vector fc relative 
to the fluctuating electric field E\. Only longitudinal waves are electrostatic 
because k x El = uoBi vanishes. In a transverse wave, B± is finite and the 
wave is an electromagnetic one. Waves propagating oblique to the field can 
be regarded as superposition of longitudinal and transversal waves. 

4.4.1 Electron Oscillations Perpendicular to B 
(Upper Hybrid Frequency) 

As for electron oscillations in an unmagnetized plasma, in the upper hybrid 
oscillations the ions stay fixed in space, creating a positively charged, uniform 
background. The plasma is cold; the thermal motion of the electrons can be 
ignored. The equation of motion then contains only the forces exerted by the 
electric and magnetic fields. 
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For longitudinal waves, the dispersion relation reads 

^uh = ^pe + <^ce ^ (4.59) 

with cjpe being the frequency of electron plasma oscillations and cjce being 
the electron cyclotron frequency. Only electrostatic waves perpendicular to 
B have this upper hybrid frequency. Disturbances parallel to B oscillate with 
the plasma frequency cjpe- particles moving parallel to the magnetic field do 
not gyrate and therefore ooce vanishes. 

We can understand these waves as the superposition of two motions. In 
the plane wave, the electrons exhibit regions of compression and rarefaction 
as in ordinary plasma oscillations. The magnetic field, which is perpendicular 
to the direction of electron motion, forces the electrons into elliptical orbits 
instead of oscillations along a straight line. As in simple plasma oscillations, 
the electric field accelerates the electrons displaced from their rest position. 
As the electron speed increases, the Lorentz force exerted by the magnetic 
field increases, too, reversing the direction of electron motion. The electrons 
therefore move against the electric field, losing energy. Thus two restoring 
forces act on the electrons: the electrostatic force resulting from the electron 
displacement and the Lorentz force. This additional restoring force leads to a 
higher frequency than in simple plasma oscillations. If the magnetic field van- 
ishes, the cyclotron frequency vanishes, too, leaving us with ordinary plasma 
oscillations. If the plasma density decreases, the plasma frequency decreases, 
too. For vanishing plasma density, the remaining motion is a gyration around 
the magnetic field line. 

4.4.2 Electrostatic Ion Waves Perpendicular to B 
(Ion Cyclotron Waves) 

The upper hybrid wave is a high-frequency wave with ou much larger than 
both the plasma and cyclotron frequencies. It results from the motion of 
electrons in a magnetized plasma. Electrostatic ion waves, like ion acoustic 
waves, are low-frequency waves. Let us now consider an ion acoustic wave 
with k nearly perpendicular to J3. In the equation of motion only the forces 
of the electromagnetic field are considered. The dispersion relation then is 

= ^ci + > (4-60) 

with Uci being the ion cyclotron frequency. An electrostatic ion wave parallel 
to the magnetic field has the same frequency == k‘^vl as an ion acoustic 
wave because the ions do not gyrate around the field and c^ci vanishes. 

The physical explanation is the same as in the upper hybrid wave: the 
Lorentz force provides an additional restoring force leading to an elliptical 
path and a higher frequency, described by the additional term Uci in (4.60). 
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4.4.3 Lower Hybrid Frequency 

In the ion cyclotron wave we have assumed k to be nearly perpendicular 
to B. The small remaining component of the wave vector and therefore the 
particle motion parallel to B is essential because it allows the electrons to 
travel freely along B to obtain thermal equilibrium as was required in the 
derivation of the ion acoustic wave. If this cannot be archived, the equation 
of motion for the electrons must be solved differently, though that for the 
ions still is valid. In this case, the lower hybrid wave results with 

Ct^lh — y/ (^ce ^ci • 

The motion is maintained in the perpendicular direction only. Thus charge 
neutrality can be maintained in that direction. Prom this charge neutrality 
we can understand the frequency given in (4.61). The fluctuating electric field 
El of the wave is perpendicular to Bq. The ions move along Ei while the 
Lorentz force acting on them is rather small. The ion displacement along Ei 
is limited because the electric field oscillates; the maximum displacement is 
about Axi = eEijm\uj\. The electrons gyrate around the magnetic field. In 
addition, they experience an FJ x H drift perpendicular to both fields. Their 
displacement parallel to E\ is given roughly as Axe = Ei/BoUce. Charge 
neutrality requires Axe == Axi and therefore u = uj\\y. 

Lower hybrid waves are of great importance in the auroral regions where 
they may be responsible for ion heating. 



4.5 Electromagnetic Waves in Non-magnetized Plasmas 

Electromagnetic waves consist of both a fluctuating electric and a fluctuating 
magnetic field. In this section we shall consider electromagnetic waves in 
an unmagnetized plasma; thus the background field Bq vanishes and then 
B = B\. Electromagnetic waves are high-frequency waves. Thus because of 
their large inertia the ions do not follow the fluctuating field. Electromagnetic 
waves therefore can be treated within the framework of one-fluid theory. In 
the equation of motion we have to consider the pressure gradient force and 
the forces exerted by the electromagnetic field. The dispersion relation for 
electromagnetic waves then reads 

= + (4.62) 

where c—\j is the speed of light in vacuum. 

The dispersion relation (4.62) is shown in Fig. 4.8. For plasma frequencies 
much smaller than the frequency of the waves we get light waves with uj = kc. 
The index of refraction for these waves is n = = ck/uj or, taking into 

consideration the electron plasma frequency. 
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Fig. 4.8. Dispersion relation for electromag- 
netic waves in a cold unmagnetized plasma. 
For small wave numbers the group velocity 
approaches zero and a plasma oscillation re- 
sults. For large wave numbers both the group 
and phase speeds converge towards the speed 
of light 




(4.63) 



Waves can propagate through a medium only if v? is larger than zero. 
Thus electromagnetic waves can exist only if cj > For transverse electro- 
magnetic waves, the ordinary light waves, the cutoff frequency is the electron 
plasma frequency, the plasma is opaque at lower frequencies. 

For (jj < ctJpe an imaginary refraction index results. For a real frequency 
an imaginary wave vector k would result. Such a wave would not propagate 
but decay. The second solution, — ifc, formally could be interpreted as wave 
growth. Physically, however, this cannot happen: because we are considering 
a cold, stationary plasma, the energy required for wave growth cannot be 
drawn from the plasma. The cold plasma only can absorb the energy of a 
decaying wave and cannot support wave growth. 



Example 19. Electromagnetic waves can be used for plasma diagnostics in 
space or in the ionosphere. For instance, the density of a space plasma can be 
determined by detecting the radio signal of a satellite from another satellite 
or a ground station. As the frequency of the radio signal is varied, absorption 
sets in at the plasma frequency of the medium; thus the density can be 
determined. 

Another application is the measurement of the electron density in the 
Earth’s ionosphere. Radio waves of variable frequency are sent from a trans- 
mitter A to a receiver B, see Fig. 4.9. As the wave enters the ionosphere, it 
is refracted according to Snell’s law. Thus from the travel time of a radio 
pulse between transmitter and receiver we can determine the height at which 
the ray path is bent down towards Earth again. Prom the geometry, we also 




Fig. 4.9. The electron density in the 
ionosphere can be determined from the 
reflection of electromagnetic waves 
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can determine the angle at which the electromagnetic wave enters the iono- 
sphere and, from Snell’s law, the refraction index. The latter gives us the 
plasma frequency and also the electron density. Ionospheric reflection is used 
in short-wave communications because it allows us to send signals around 
the Earth. With a maximum ionospheric electron density of about 10^^ m“^, 
the critical frequency for reflection is about 10 MHz. 

The experiment sketched in Fig. 4.9 can be simplified to a vertical geom- 
etry in which the transmitter and receiver are located at the same position. 
Total reflection of the pulse then occurs when the emitted frequency equals 
the local plasma frequency, i.e. n — ^. The travel time of the signal gives the 
height of reflection. Emitting at different frequencies over a broad frequency 
band, the height profile of the electrons in the ionosphere can be determined. 
In principle, this sounding experiment is very simple because reflection occurs 
exactly at the point where the signal frequency is equal to the local plasma 
frequency. But a small error remains: the electron density varies continu- 
ously with height, leading to changes in the propagation speed of the wave. 
Therefore, the estimate of the reflection height from the propagation time 
is not exact. In reality, the situation becomes even more complex because 
the electromagnetic wave propagates into a magnetized plasma and is split 
into ordinary and extraordinary modes, both having different propagation 
speeds. □ 

Example 20. The electron density in the quiet ionosphere at 120 km height is 
about 2 X 10^ cm“^ (see Fig. 8.18). Reflection occurs for n = 0, that is, when 
the local plasma frequency cjpe equals the wave frequency. From (4.52) we 
find /pe = 2.4 MHz. During a sudden ionospheric disturbance, the electron 
density is increased by a factor of 3. Now the critical frequency for reflection 
at 120 km height becomes 7 MHz. The 2.4 MHz wave reflected originally at 
this height is now reflected at a lower height, and consequently its range of 
propagation is reduced and the signal is not received where it was supposed 
to be received: communication is inhibited. □ 



4.6 Electromagnetic Waves in Magnetized Plasmas 

As in the case of electrostatic waves, electromagnetic waves in magnetized 
plasmas are characterized by the direction of the wave vector relative to the 
background magnetic field Bq and the fluctuating electric field E\. 

4.6.1 Electromagnetic Waves Perpendicular to Bq 

Let us start with an electromagnetic wave perpendicular to the undisturbed 
magnetic field. The wave is assumed to be transversal, and thus k 1. E\. 
Two modes are possible: either the fluctuating electric field is parallel to Bq 
(ordinary wave) or it is perpendicular to Bq (extraordinary wave). 
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Ordinary Waves {Ei\\Bo)> For Ei\\Bq the wave equation takes the same 
form as in an unmagnetized plasma, leading to the dispersion relation 

— ^pe + • (4.64) 

The propagation of ordinary or 0-waves therefore is not influenced by the 
magnetic held: in an ordinary wave, the fluctuating electric fleld is parallel to 
.Bo and therefore the magnetic fleld does not influence the wave dynamics. 



Extraordinary Waves (Bi ± Bq). If the fluctuating electric fleld is per- 
pendicular to Bo, the electron motion is influenced by the magnetic fleld and 
the dispersion relation has to be modifled accordingly. As the wave propagates 
perpendicular to Bo and Ei oscillates perpendicular to Bo, the wave vector 
k has components parallel and perpendicular to Bi. The wave therefore is a 
mixed mode of both a transversal and a longitudinal component. While the 
ordinary wave is linearly polarized (Bi only along Bo), the extraordinary 
wave is elliptically polarized. The dispersion relation for the extraordinary 
wave can be written as 






. ,2 . ,2 _ . ,2 

^pe ^ ^pe 2 

— ^ = n 






(Jj^ UJ 



2 _ 






uh 



(4.65) 



The influence of the magnetic fleld is contained in the upper hybrid frequency 
(4.59). The dispersion relation for both ordinary and extraordinary electro- 
magnetic waves is shown in Fig. 4.10. The hatched area indicates the stop 
band separating two different modes of the extraordinary or X-wave. 

A closer look at the refraction index (4.65) helps us to understand these 
two modes. The refraction index becomes zero for 



^^,R) “ y ^pe + > (4.66) 

with ‘R’ and ‘L’ indicating right-hand and left-hand polarized waves. 

The frequencies deflned by (4.66) are cutoff frequencies: for lower fre- 
quencies the refraction index becomes imaginary and the wave vanishes. The 
ordinary wave has its cutoff at the electron plasma frequency. The refraction 
index also can go towards inflnity. This happens for very large wave numbers 
(or very small wavelengths). The corresponding frequency can be found by 
letting k go towards inflnity. This occurs when uj goes towards the upper hy- 
brid frequency (Juh in the denominator of (4.65). As the extraordinary wave 
approaches this resonance, its phase and group velocities approach zero and 
the electromagnetic energy is converted into electrostatic oscillations. There- 
fore, in Fig. 4.10 between the upper hybrid frequency cjuh and the cutoff 
frequency ur a stop band results where, because n < 0, the extraordinary 
wave cannot propagate. 

The existence of a stop band has an interesting application. Magnetized 
plasmas are emitters of radio waves. Therefore, all planets emit radio sig- 
nals. In the solar system, the largest radio source is Jupiter. While it is easy 
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Fig. 4.10. Dispersion relation for or- 
dinary (O) and extraordinary (X) elec- 
tromagnetic waves in a cold magnetized 
plasma. The hatched region between ujuh 
and Ur separates the two modes of the ex- 
traordinary electromagnetic wave 



to detect these waves, their interpretation in terms of sources and excitation 
mechanisms is difficult. The stop band, however, allows us to exclude one pos- 
sible mechanism. The radio waves are excited close to the planet where both 
the electron plasma frequency as well as the electron cyclotron frequency are 
large. As the waves propagate outward, their frequency decreases. Extraor- 
dinary waves excited at a frequency below the upper hybrid frequency will 
encounter a stop band where cJuh has decreased below the frequency of the 
wave. These waves cannot escape from the vicinity of the planet. Planetary 
radio waves observed at large distances therefore have not been excited as 
extraordinary waves below the local upper hybrid frequency. 



4.6.2 Waves Parallel to the Magnetic Field: 
Whistler (R- Waves) and L- Waves 



Let us now consider electromagnetic waves propagating parallel to jBq- These 
waves are of particular importance in the magnetosphere because they can 
propagate along B towards the ground. Again, two cases can be distinguished: 
the right-hand and the left-hand waves. The dispersion relation becomes 






u 



2 

pe 




and the refraction index is 

2 _ C^/C^ _ 

^ l±(jJce/^^ 



(4.67) 



(4.68) 



In both equations the sign refers to the right-hand or R-wave. The electric 
field of an R-wave rotates in the same way as an ordinary screw rotates: if 
the thumb of the right-hand points in the direction of fc, the curved fingers 
point in the rotation direction of the electric field. Thus the R-wave rotates in 
the same direction as an electron gyrates. The L-wave rotates in the opposite 
direction, following the ion gyration. Thus both waves have a resonance. The 
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R-wave has a resonance at a; = cjce- The electron gyrating around the mag- 
netic field then experiences a constant electric field which, depending on the 
phase between the electron and the field, either accelerates or decelerates it 
continuously. This resonance is called cyclotron resonance. At this resonance 
the phase velocity is zero and the wave does not propagate. The L-wave does 
not resonate with the electrons because the wave field rotates in the direction 
opposite to the electron gyration. Thus the L-wave has no resonance at high 
frequencies, and its resonance is lu = The cutoff frequencies for L- and 
R- waves are the same as for the extraordinary waves: 






cutoff 

(L,R) 





(4.69) 



The dispersion relation for electromagnetic waves propagating parallel to the 
magnetic field is shown in Fig. 4.11. Two cases can be distinguished: cjpe < ooce 
(right panel) and cjpe > coce (left panel). The main difference between these 
cases is the cutoff for the left-hand mode: for ujpe < the cutoff is below 
the electron cyclotron frequency, while in the opposite case it is above. For a 
constant cyclotron frequency these waves approach the = 1 line for high 
frequencies. The R-wave has a stop band between ujr and ujce- The R- waves 
in the lower frequency range also are called Whistler waves and are important 
for propagation studies in the magnetosphere. Whistler waves can be excited 
by lightning discharge. Thus the source has a short duration but creates a 
wide spectrum of different frequencies. Since the propagation time depends 
on the group speed, the first waves arriving at a distant observer have higher 
frequencies than those arriving later, rather like a whistle with decreasing 
pitch. The rate of frequency change contains information about the change 
in plasma density along the propagation path. 



(a) Ct^pe ^ (b) ^ce 





Fig. 4.11. Dispersion relation for electromagnetic waves propagating parallel to 
the magnetic field in a cold magnetized plasma 
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4.7 Summary 

Table 4.1 summarizes the waves discussed in this chapter. In electrostatic 
waves only the electric field fiuctuates while the magnetic field either is static 
or zero. In electromagnetic waves, both the electric and magnetic fields oscil- 
late. The waves are further divided into electron and ion waves. In electron 
waves the electrons oscillate while the ions create a uniform background. 
Therefore, electron waves are high-frequency waves. In ion waves, both ions 
and electrons oscillate. Because of the large inertia of the ions, ion waves are 
low-frequency waves. The propagation of the wave depends on the orientation 
of the wave vector k relative to the background magnetic field and relative 
to the fiuctuating electric field. 

We should note that this set of dispersion relations is greatly simplified 
in so far as it considers only waves propagating into the principal directions 
perpendicular or parallel to the field. Waves propagating oblique to the field 
are more difficult; however, often they can be understood as the superposition 
of the two modes parallel and perpendicular to the field. 



Table 4.1. Different types of plasma waves 



Wave 


Geometry 


Dispersion relation Equation 


Electron Waves (Electrostatic) 




Langmuir waves 


Bo = 0 
or fe||Bo 


= Wpe + 3A:^Wth/2 (4.55) 


Upper hybrid waves 


k±Bo 


Wuh = Wpe + (4.59) 


Ion Waves (Electrostatic) 




Ion acoustic waves 


Bo =0 
or fc||B 


up = k'^vl (4.57) 


Ion cyclotron waves 


k±Bo 


Up" = 4- Pvl (4.60) 


Lower hybrid waves 


K±Bo 


^Ih ~ ^ce (4-61) 


Electron Waves (Electromagnetic) 




Light waves 


Bo = 0 


up = Wpe + k'^P (4.62) 


0- waves 


k ± Bo, 
BillBo 


up = Pk^ + Wpe (4.64) 


X-waves 


k _L Bo, 


= PP+ 




El J_ Bo 


+u,l^{uj^ -u;l)/{uj^ -ujD (4.65) 


Whistler (R-waves) 


k\\Bo 


= PP - ulJ[\ - {uJcPuj)] (4.67) 


L- waves 


fc||Bo 


= PP + Wpe/[1 + (Wce/w)] (4.67) 


Ion Waves (Electromagnetic) 




Alfven waves 


k\\Bo 


Up = Pv\ (4.37) 


Magneto-sonic waves 


k ± Bo 


= PP [vl + vi)l{P + v%) (4.48) 
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Some of these waves can be used for plasma diagnostics, with the sources 
of the waves either being natural, such as in Whistler waves to study prop- 
agation in the magnetospheric plasma, or artificial, such as radio waves to 
probe the ionosphere. 



Exercises and Problems 

4 . 1 . Explain, in your own words, the important quantities characterizing a 
wave. What are group and phase speeds? 

4 . 2 . Show that in an electron plasma wave the energy contained in the elec- 
tron oscillation exceeds the energy in the ions by the mass ratio m\j{Z\m^. 

4 . 3 . On re-entry into the Earth’s atmosphere, a spacecraft experiences a radio 
blackout due to the shock developing in front of the spacecraft. Determine 
the electron density inside the shock if the transmitter works at 300 MHz. 

4 . 4 . Show that the maximum phase speed of a Whistler wave is at a frequency 

uj — Prove that this is below the speed of light. 

4 . 5 . Show that if a packet of Whistler waves with a spread in frequency 
is generated at a given instant, a distant observer will receive the higher 
frequencies earlier than the lower ones. 

4 . 6 . Show that if the finite mass of the ions is included, the frequency of 

Langmuir waves in a cold plasma is given by -h 

4 . 7 . How would you use pulse delay as a function of frequency to measure 
the average plasma density between the Earth and a distant pulsar? 

4 . 8 . Determine the Alfven speeds and the electron plasma frequencies for the 
situations described in Problem 3.12. 

4 . 9 . Use Fig. 4.4 to describe the properties of magnetohydrodynamic waves 
propagating parallel to ior va > Vs and va < Vq. 

4 . 10 . Show that in an Alfven wave the average kinetic energy equals the 
average magnetic energy. 

4 . 11 . Discuss an Alfven wave with k\\Bo. (a) Determine the dispersion rela- 
tion under the assumption of a high but finite conductivity (the displacement 
current nevertheless can be ignored), (b) Determine the real and the imagi- 
nary parts of the wave vector for a real frequency. 
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All animals are equal 
but some animals 
are more equal 
than others. 
G. Orwell, Animal Farm 



Magnetohydrodynamics has treated the plasma as a fluid with all particles 
moving at the same speed, the bulk speed. The thermal motion is ignored. 
Space plasmas often are hot plasmas with the thermal speed by far exceeding 
the flow speed. Thus thermal motion has to be taken into account and the 
plasma has to be described by a distribution function which considers the 
positions and velocities of the individual particles. The Boltzmann equation 
gives the equation of motion for this phase space density. 



5.1 The Distribution Function 

Kinetic theory starts from the physics of individual particles (the microscopic 
approach). The macroscopic phenomena then can be described by averaging 
over a sufficiently large number of particles, an approach which also is used 
in statistical mechanics. This formalism therefore is called the statistical de- 
scription of a plasma. 

5.1.1 Phase Space and Distribution Function 

The mechanical properties of each particle are described completely by its 
position and momentum. The phase space is a six-dimensional space deflned 
by the three spatial coordinates qi,q2,Qs and the three generalized momenta 
Pi^P2^P3‘ Each particle is related unambiguously to one point in phase space: 

Q = i<li,q2,q3-,Pi,P2,P3) = {q,p) ■ ( 5 . 1 ) 



The speed of the particle in phase space, i.e. the combined change in its 
position and momentum in ordinary three-dimensional space, then is 
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^ ^ ^ ^ ^ /. 
dt V di ’ dt ’ di ’ dt ’ di ’ di 7 V di ’ di 7 ' ^ 

For a plasma consisting of N particles, in phase space N such points exist. The 
phase space density f{q,p,t) gives the number of particles inside a volume 
element [{qi,qi + d^i), {pi^Pi + dp^)]. This density function is also called the 
distribution function of the plasma. It is related to the particle density in 
ordinary three-dimensional space by 

+c» 

n{q,t)= J f{q,p,t)d^p. (5.3) 

— CX) 

The number density is used in the definition of averages which describe the 
macroscopic properties of the plasma. If a(g, p, t) is a function in phase space, 
its average is defined as 

oo 

{a{q, t)) = j a{q, p, t) f{q, p, t) d^p . (5.4) 

— OO 

The average or bulk speed of the plasma, for instance, is given as 

oo 

u{q, t) = {v{q, t)) = j v{p, q, t) f{q, p, t) d^p . (5.5) 



Application of this averaging scheme to the equation of motion yields the 
Vlasov equation (5.23) which is the basic equation of statistical plasma 
physics and can be used to derive the MHD equations of two-fiuid theory. 



5.1.2 Maxwell’s Velocity Distribution 

The average speed (v) of the particles in a plasma, as defined by (5.5), is 
also the plasma fiow speed u. The speeds vi of individual particles, however, 
can be substantially different; in particular, speeds of individual particles can 
exceed the fiow speed by orders of magnitude. A plasma contains different 
particle species s which all have their own average speed Us . If in an electron- 
proton plasma the average speeds Uq and Up are different, a current results. 

To derive the velocity distribution of the particles, let us first determine 
the kinetic energy contained in a volume element of the plasma. The kinetic 
energy of the plasma fiow is determined by the average speed u, while the 
entire kinetic energy is the sum of the kinetic energies of all particles. Since 
the latter is larger, there is also kinetic energy contained in the stochastic 
motion of the particles, which can be described by 

/rn _ 2 \ _ Jm{v-uff{r,v,t)dv 

\ 2 / 2 J f{r,v,t)dv 



(5.6) 




5.1 The Distribution Function 115 



This “random” kinetic energy is related to the hydrostatic pressure by 



P 

n 



2 

N 





(5.7) 



where N is the number of degrees of freedom, normally three. 

If the system is in thermal equilibrium, which in a hot plasma is not neces- 
sarily the case, the velocity distribution is given by the Maxwell distribution: 



f{r,v,t) = n 




(5.8) 



where T is the plasma temperature and fee is the Boltzmann constant. Ac- 
cording to (5.8), the relative number of particles with large stochastic speeds 
\v — u\ increases with T. The distribution’s maximum is at the most probable 
thermal speed Vth'^ 



Vth 



2kBT 

m 



(5.9) 



In a one-atomic gas in equilibrium, the temperature is related to the 
kinetic energy of the stochastic motion by 



{^m{v - u)^) = ^NkBT . (5.10) 

In a plasma, N normally equals 3. Even in a magnetized plasma N equals 
3 because the particle speed is described completely by one of the triples 
'^x — Vy — and Vz — or ?;|| , |'v_l — n|, and ipy, the latter describing the 
direction of the perpendicular speed relative to the gyro-center. Note that 
the combination of (5.7) and (5.10) yields the ideal gas law p = nkBT. 

Occasionally, we are concerned only with particle speeds and not with the 
direction of motion. This might be the case if the plasma is at rest, i.e. u 
equals zero. The distribution function (5.8) then is 

ffnr. V, t) df2y di; = (47r/(r, \v\,t)v‘^) di; = g{r, v) d?; (5.11) 



where i? is the solid angle. Equation (5.11) gives the number of particles inside 
a volume element with speeds between v and v -h dt». The function g{r^v) 
gives the number of particles per velocity unit, again with speeds between 
V and V dv. For small speeds, this function increases with the square of v 
while for large speeds it decreases exponentially (see upper panel in Fig. 5.1). 
If not the speed, but only one component of the velocity is considered, the 
distribution is symmetric around zero if the plasma is at rest (middle) or 
symmetric around the flow speed in that particular direction if the plasma is 
in motion (lower panel). 
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Fig. 5.1. Maxwell distribution, (a) 
Distribution g{v,r) for the particle 
speeds in a plasma at rest, (b) Maxwell 
distribution of one velocity component 
for a plasma at rest, the other velocity 
components have been removed by in- 
tegration. (c) Same as above but for 
a plasma moving with average speed 
Ux in the x-direction. Note that an in- 
crease in temperature would not affect 
the position of the maximum in (b) and 
(c) but would shift the maximum to- 
wards the right in (a), combined with 
an increase of the maximum 



5.1.3 Other Distributions 



Not all particle distributions can be described by a Maxwellian. If the plasma 
is not in equilibrium, normally no analytical distribution function exists, al- 
though often distribution functions are reasonable approximations. 

For instance, if a plasma has a marked difference in the speeds parallel and 
perpendicular to the magnetic field, the distribution can be approximated by 
a bi-Maxwellian, that is the product of two Maxwellians, which take into 
account the different speeds and temperatures of the two motions: 



f{r,v,t) 



n 



r m(u||-U||)2 

X exp < - - — > exp 

2A:bT|| ^ 



m{v± — ux)^ 
fcsTx 



(5.12) 



The average kinetic energies parallel and perpendicular to the field are 

(|m(i;|| - U||)^) = and - ux)^) = A:bTx • (5.13) 

Here the difference in the number of degrees of freedom for the parallel and 
perpendicular motions becomes obvious: there is one degree of freedom par- 
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Fig. 5.2. Bi-Maxwellian distributions often found in space plasmas 



allel to the field while there are two perpendicular to it. Equation (5.13) can 
also be used to define the pressure perpendicular and parallel to B. 

The bi-Maxwellian (5.12) is a typical anisotropic distribution function 
of the form /(f_L,f||). This kind of distribution function is the one most 
commonly found in space plasmas. It is an essentially two-dimensional and 
gyrotropic velocity distribution: it does not depend on the phase angle of the 
gyromotion. Figure 5.2 shows contour-plots for different (bi-)Maxwellians: 
the isotropic distribution (left) is characterized by circular contours. In the 
anisotropic Maxwellian, the contours are deformed into elliptical shapes. In 
this example, the deformation is such that it corresponds to Ty > Tj_. During 
magnetic pumping (example 9) we have a continuous change from an isotropic 
to an anisotropic distribution and back. Special distributions arise if either 
or u± vanishes in (5.12). For instance, a plasma might drift perpendicular to 
the magnetic field, such as in the case of crossed electric and magnetic fields. 
Then u\\ vanishes and a drifting Maxwellian results (see Fig. 5.2). If the drift 
velocity is large compared with the thermal velocity, such a distribution is 
called a streaming distribution. If the distribution drifts along the magnetic 
field at speed ifc||, and u± vanishes, it is a parallel-beam distribution. Such 
distributions are frequently encountered in the auroral regions of the mag- 
netosphere and in the foreshock regions of planetary bow shocks. The last 
sketch in Fig. 5.2 illustrates a loss cone distribution, where particles with 
sufficiently small pitch angles are lost from the magnetospheric population. 

Occasionally, the particle distribution can be described by a Maxwellian 
up to a certain energy. At higher energies the particle distribution can be fit- 
ted much better by a power law than by the exponential decay of the Maxwell 
distribution. Here the kappa distribution, sometimes also called Lorentzian 
distribution. 






3 r 



1 + 



miv 





2kE'j' 



21 



-K—1 



(5.14) 



can be used as an approximation. The parameters k and are characteris- 
tics of the distribution, with Et being closely associated with the temperature 
T and k, describing the deviation of the distribution from a Maxwellian. For 
energies E ^ /^Ft, the distribution decays more slowly than a Maxwellian 
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and can be approximated by a power law in kinetic energy: f{E) = /o E~‘^. 
If K converges towards infinity, the distribution is a Maxwellian with temper- 
ature kT = Et- Note that for the kappa distribution, as well as for all other 
non-Maxwellian distributions, (5.10) cannot be applied. 



5.1.4 Distribution Function and Measured Quantities 

While the distribution function is important for our theoretical treatment of 
(space) plasmas, it is a quantity which cannot be measured directly. Instead, 
observations give the differential flux J{E^ r, t) of particles within a solid 
angle dl7 and an energy interval (E, E -|- dE). Thus the quantity 

J{E, 17,r,t) dAdOdt dE 

is the number of particles in the energy band from E to E + dE coming from 
the direction f2 within a solid angle dl7, going through a surface dA per- 
pendicular to df2 during the time interval dt. The differential flux therefore 
can be measured in units of particles per (m^ sr s MeV). Since J depends on 
i7, it can also be interpreted as the angular distribution of the particles. On 
a rotating spacecraft, often the omnidirectional intensity is measured. The 
latter can be obtained by averaging over all directions: 

Jomni{E,r,t) = ^ j J{E,n,r,t)dn . (5.15) 

The number density of particles with velocity in a phase space element 
is given as dn = fv^dvdf2. Multiplication by v gives the differential flux 
of particles with velocity v as f{v,p,t)v^dvdf2. Comparison with the same 
quantity expressed by the differential flux yields 

J{E^ 17, r,t)d£’di? = f{r,p,t)v^ dvdf2 . (5.16) 

Since the energy is related to speed, dE is related to dt? by dE = mvdv. The 
relation between the differential flux and the distribution function therefore 
can be written as 

J(E,17,r,t) = — /(r,p,t) . (5.17) 

5.2 Basic Equations of Kinetic Theory 

As mentioned above, the equation of motion in kinetic theory can be de- 
rived by applying the averaging scheme (5.4): the Boltzmann equation is the 
fundamental equation of motion in kinetic theory; the Vlasov equation can 
be applied if the forces are purely electromagnetic; and the Fokker-Planck 
equation also considers scattering. 
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5.2.1 The Boltzmann Equation 



The Boltzmann equation is the fundamental equation of motion in phase 
space. It is not limited to plasmas, and the only assumption inherent in the 
Boltzmann equation is that only external forces F act on the particles while 
internal forces vanish: there are no collisions between the particles. 

The Boltzmann equation is a direct consequences of the equation of con- 
tinuity in phase space: 

^ + V6-(/C)=0 or ^ + V,-(r/) + Vv-(a/) = 0, (5.18) 

where Ve, Vr, and Vv are the divergence in phase space, in ordinary space, 
and in momentum space, respectively, and v and a are velocity and accelera- 
tion. In phase space, r and v are independent variables. If we further assume 
that the acceleration a, and therefore also the force jP, is independent of v, 
(5.18) can be simplified: 

^+«-V,/ + a-Vv/ = 0 or ^ + v -Vf + - ■ ^ = 0 . (5.19) 

ot at m ov 

Equation (5.19) is called the collisionless Boltzmann equation. It also can be 
written as 

1=0, (5.20) 

which states that the convective derivative of the phase space density is al- 
ways zero for a collisionless assembly of particles. Thus for an observer moving 
with the flow, the phase space density is constant. Or, in other words: the 
substrate of points in phase space behaves like an incompressible fluid. This 
is also called Liouville’s theorem. 

The general form of the Boltzmann equation can be written as 



df 

— -\-v 
dt 







(5.21) 



where the term on the right-hand side is the rate of change in phase space 
density due to collisions (see below). 

If changes in / due to collisions are small, e.g. in the case of a thermody- 
namic equilibrium, the reduced Boltzmann equation can be written as 




(5.22) 



The solution of this equation is the Maxwell distribution. 
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5.2.2 The Vlasov Equation 



The Vlasov equation is the application of the Boltzmann equation to a plasma 
on which only electromagnetic forces act. These forces are described by the 
Lorentz force (2.23). In the derivation of (5.19) we have made the assumption 
of a force independent on t;. At first glance the Lorentz force violates this 
assumption and therefore should not be considered in (5.19). Closer inspec- 
tion, however, shows that this is not true. Since the Lorentz force contains the 
cross product of speed and magnetic field, the resulting force is perpendicular 
to the speed. Thus each individual component of the force does not depend 
on the same component of the velocity. Since in (5.18) a scalar product is 
considered, the only derivatives of a are of the form dax/dv^ and therefore 
vanish. Thus the Lorentz force can be inserted into (5.19): 



df 

dt 



V/T 



m 






V X B 



dv 



(5.23) 



Equation (5.23) is called the Vlasov equation. Because of it simplicity, this is 
the equation most commonly studied in kinetic theory. 

The Vlasov equation is derived under the assumption of non-interacting 
particles. On the other hand, interactions are the very essence of a plasma. 
Thus we have to discuss whether the Vlasov equation can be applied as often 
as it is. As we shall see, the Vlasov equation is a valid approach. It does not 
consider collisions in the sense of short-range, local interactions, such as col- 
lisions between two billiard balls or Coulomb collisions between two charged 
particles. Nonetheless, that kind of interaction, which is essential in a plasma, 
is considered: each particle moves in the average Coulomb field created by 
thousands of other particles. Thus the fields in the Vlasov equation are due 
to the rest of the plasma and describe the interaction of the particles. These 
fields often are called self-consistent fields. External fields can be included 
in (5.23), too. Since the fields E and B are determined by the rest of the 
plasma, they depend on the distribution function /. 

The Vlasov equation thus is non-linear; analytical solutions in general 
are not possible. But Jeans’ theorem identifies some solutions. It states: any 
function of the constants of motion is a solution of the Vlasov equation. For 
instance, if there are no electric fields, the kinetic energy is a constant of 
motion. Thus any function of mv‘^/2 is a solution of the Vlasov equation. In 
particular, the Maxwell distribution (5.8) is a solution. 

Jeans’ theorem therefore shows the equivalence of kinetic theory and or- 
bit theory. Following an approach given by Boyd and Sanderson [58], this 
equivalence can be shown easily. The basic equation of orbit theory is New- 
ton’s second law F = md^r/dt. This is a second-order differential equation 
in three dimensions and therefore the general solution must contain six con- 
stants of integration, 71,..., 75. Thus the solutions of Newton’s second law 
can be written as r = r( 7 i, ...,7e,t) and v — '^(71, ...,7e,t). These six scalar 
equations can be solved in principle to give the 7 ^: ji = 7 ^( 1 ;, r,t). Jeans’ 
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theorem then states that each function / = /(71, ••.,76) is a solution of the 
fundamental equation of kinetic theory, the Boltzmann equation (5.19). This 
can be seen easily by inserting the 7^ into the Boltzmann equation: 




dli , Y7 , ^ 

+ V * V7i H 

at m ov 






df d7j 
d'fi dt 



= 0 



(5.24) 



The result is identically zero because the 7, are constants. 



5.2.3 The Fokker— Planck Equation 



In contrast to the Vlasov equation, the Fokker-Planck equation considers the 
short-range, local interactions between particles. Collisions arise from many 
small Coulomb interactions between charged particles (see Sect. 5.3.2). The 
collision term has its mechanical analogy in the Brownian motion of particles 
in a gas; however, both are not equivalent as will be discussed below. 

Collisions are not a deterministic but a stochastic process. Thus for a 
given particle, although we might know its momentary position and velocity, 
we cannot determine its future motion. Only for an assembly of particles 
the collective behavior can be determined. This can be done by means of 
probabilities. Let '0(v, At;) be the probability that a particle with velocity v 
after many small collisions during a time interval dt has changed its velocity 
to t; + At;. The phase space density f{r,v^t) also is a probability function. 
At a time t it can be written as the product of the phase space density at an 
earlier time t — At multiplied by the probability of changes during this time 
interval and integrated over all possible velocity changes At;: 



f{r,v,t) = j f(r, V — At;, t - At) ^(t; - At;, At;) d(At;) 



(5.25) 



Since we only consider scattering by small angles, i.e. |At;| ^ |t;|, Taylor 
expansion to second order of the product f'lj; yields 



f(r,v,t) = J 



+ 



/ 



/(r, v,t — At) Av) — Av 
d(At;) . 



dv 



At; At; ^ d‘^{f'tp) 



d(At;) 



dvdv 



(5.26) 



Note that the © in the last term indicates a product between two tensors.^ 
The resulting matrix is the Hess matrix. 

Because some interactions always take place, the probability can be nor- 
malized to f '0d(At;) = 1. Equation (5.26) than can be simplified to 

f(r,v,t) = f(r,v,t-At) - ® ifi^vAv)) , (5.27) 



^ The product S O T of two tensors S and T itself is a tensor and can be obtained 
by application of the rules of matrix multiplication. 
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with 



(Av) = J ^Avd(Av) and (AvAv) = j 'ipAvAvd{Av) . (5.28) 

By definition, the collision term as written down in (5.21) is 
[df\ f{r, V, t) - f{r, V, t - At) 

= Ai ■ 

Thus the Fokker-Planck equation can be written as 

The first term on the right basically contains {Av)/At, which is an accelera- 
tion. Thus the term describes the frictional forces leading to an acceleration 
of the slower and a deceleration of the faster particles, which tends to equalize 
the speeds. The negative divergence in velocity space describes this narrowing 
of the distribution function. The second term, {AvAv)/At^ is a diffusion in 
velocity space. This term describes the broadening of a narrow velocity dis- 
tribution, e.g. a beam, as a result of the collisions. The two terms therefore 
operate in the opposite sense. They are in balance in an equilibrium distri- 
bution, e.g. the Maxwell distribution. The physics of the collision processes 
is contained in the probability function '0. 

Equation (5.30) also can be written as 



%) At = -Vv • (D • Vv • /) (5.31) 

with the diffusion tensor D derived from the first- and second-order fluctua- 
tions of the particle velocity. 

Figure 5.3 shows the evolution of a suprathermal distribution of particles. 
At time to the particle distribution is a parallel beam of uniform speed. Since 
the particles are much faster than the plasma, at early times {ti and ^ 2 ) pitch 
angle scattering dominates, and the distribution spreads towards larger pitch 




Fig. 5.3. Evolution of a 
suprathermal distribution of 
particles from a monoener- 
getic beam at time to towards 
an isotropic ring distribution 
at later times ts 
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angles, leading finally to an isotropic ring distribution around the origin {ts). 
With increasing time, friction and energy losses become important and the 
ring distribution contracts towards the origin. This is an example of a more 
general rule of thumb: the faster a particle moves through a plasma, the 
smaller is its frictional drag.^ This longevity property allows the existence of 
suprathermal particles as a distinct population in some cosmic plasmas. For 
instance, galactic cosmic rays or solar energetic particles both are long-lived, 
distinct particle populations in the solar wind; the radiation belt particles are 
a distinct population in the terrestrial plasmasphere. Obviously, these plas- 
mas can not be described by a Maxwellian. Instead, the kappa distribution 
gives a quite reasonable description. 



5.3 Collisions 

We have mentioned collisions twice in this chapter. In Sect. 5.1.2 we intro- 
duced the Maxwell distribution. The basic requirement for such a distribution 
is thermal equilibrium, which requires collisions between the particles. If we 
have a distribution with a suprathermal tail, such as the kappa distribution, 
in time collisions will transform it into a Maxwellian. This time scale de- 
pends, of course, on the time scales of the collisions. We have also mentioned 
collisions in connection with the Fokker-Planck equation. We have even men- 
tioned that the collisions should lead to small changes in speed only. But we 
did not talk explicitly about the nature of these collisions. This section is 
supplementary, briefly introducing some of the basics of collisions. 

Collisions are also important in the energy transfer between different com- 
ponents in a plasma: imagine a plasma which also contains neutral particles. 
The charged particles might be accelerated by an electric field. In time, col- 
lisions between charged and neutral particles will equalize the two distribu- 
tions, leading to an acceleration of the neutrals. 

As in a neutral gas, collisions in a plasma change the path of the individual 
particle. In a magnetized plasma, collisions between a charged particle and 
a neutral can shift the gyro-center of a particle onto another field line (see 
Fig. 5.4). Collisions between charged particles can also lead to a shift in the 
gyro-center and/or changes in pitch angle. 

5.3.1 Collisions Between Neutrals 

Collisions between neutral particles give rise to the Brownian motion in a 
gas. The individual process is a collision between two hard spheres. The hard 

^ Graphically this can be understood from the fact that with increasing particle 
speed the time available for interaction between the energetic particle and a 
particle of the background plasma decreases; a more formal explanation is given 
in Sect. 5.4. 
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Fig. 5.4. Collisions in a plasma can shift the 
gyro-center of a particle onto another field line, 
here illustrated for a collision of a charged par- 
ticle with a neutral one 



sphere model is a simple and quite useful approximation. The full treatment 
of the collisions between neutrals requires a quantum-mechanical approach 
considering the attracting van der Waals forces and the repulsing Coulomb 
forces of the electron shells of the two atoms. The van der Waals potential 
varies roughly with the repulsing potential with [569]. Combination 
of both leads to an extremely steep potential surrounded by a very shallow 
potential depression in the order of meV compared with the typical eV range 
in molecule formation. A hard sphere therefore is a reasonable approach to 
describe the collision of neutrals. 

The basic equations are the conservation of momentum and the conser- 
vation of energy. The changes in momentum and direction depend on the 
masses and speeds of the particles and on the angle between their veloci- 
ties. The change in momentum is largest in a head-on collision: when mass is 
equal, the particle loses twice its initial momentum as its velocity is reversed. 
Thus scattering by a large angle up to 180° is possible in collisions between 
neutrals. 

The relevant parameters to describe the scattering process are the mean 
free path and the scattering cross section. The particle mean free path A is 
defined as the average distance travelled by a particle between two subsequent 
collisions. If we could follow a smoke particle in air, we would detect a path 
similar to the one depicted on the left-hand side of Fig. 5.5. The statistical 
motion is composed of many straight lines with different length L. The right- 
hand side of Fig. 5.5 shows the distribution of these L. This probability 
distribution can be described by a function p{L) = a exp(— L/A) with a 




Fig. 5.5. Statistical 
path of a particle un- 
der the influence of 
collisions (left) with 
other particles (Brow- 
nian motion) and dis- 
tribution of the path 
length between two 
collisions (right) 
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being a constant depending on the total number of collisions and A being the 
particle mean free path. 

The number of collisions, and therefore the mean free path, depends on the 
number density of particles and on their ‘size’ as described by the scattering 
cross section. Consider a fast particle with radius ri moving in a gas of slow 
particles with radii r 2 - A collision happens if the distance between the two 
particles has decreased below ri + r 2 . Alternatively, we can assume the fast 
particle to be a mass point. Then we have to attribute a radius of ri + V 2 to 
the gas molecules. Thus for the fast particle, a gas molecule is equivalent to 
a disk with the scattering cross section cr == 7r(ri + r 2 )^. 

Now consider a beam of particles incident on a slab of area A and thick- 
ness dx. The number density of molecules in this slab is rig, the total number 
of molecules in the slab is risAdx. The fraction of the slab blocked by atoms 
therefore is aUsAdx/A = arisdx. Out of N particles incident on the slab, 
AN = Nriscrdx will experience a collision, leading to a reduction in N ac- 
cording to dN/N = —crris dx. Integration yields 

N{x) — Nq exp(— crngx) == Nq exp(— a:/A) , (5.32) 

where the mean free path A is defined as 

A = — . (5.33) 

TlsCT 

Thus, the mean free path can also be interpreted as the distance over which 
the number of particles decreases to 1/e of its initial value. After travelling a 
distance A, the particle will have a high probability of colliding. The average 
time between two collisions is 

-A - ^ 

{v) riscr{v) 

This also can be written as a collision frequency Uc. 

Uc = riscr(u) . (5.35) 

The formalism for interactions between a charged particle and a neutral 
is the same as for a collision between two neutral particles. 

5.3.2 Collisions Between Charged Particles 

Collisions between charged particles do not require a direct contact, instead 
the interaction takes place as each particle is defiected in the electric field of 
the other one. Since the Coulomb force has a long range such an interaction 
leads to a gradual defiection. Nonetheless, one can derive a kind of cross 
section for this process. Following the attempt given in Chen [97], we shall 
only make an order-of-magnitude estimate. 



(5.34) 
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Fig. 5.6. Coulomb scatter- 
ing: orbit of an electron in the 
Coulomb field of an ion 



The geometry of a Coulomb collision is shown in Fig. 5.6: an electron 
with velocity v approaches an ion with charge e. If no Coulomb force acts, 
the electron will pass the ion at a distance tq, the impact parameter. But 
the Coulomb force F = — e^/(47rr^) leads to a deflection of the electron from 
its original direction by an angle The force acts on the electron for a 
time T ^ To/ V when it is in the vicinity of the ion. The change in electron 
momentum then can be approximated by Ap = \FT\ ^ e'^ / {47rrov) . For 
a 90° collision, the change in momentum is of the order of mv. Thus it is 
Ap = mv = e^/{4'Krov). A deflection by 90° results for an impact parameter 
rgQo = e^/(47rmu^). The cross section for a deflection of at least 90° therefore 
can be written as 



(^> 90 ° = = 






(5.36) 



leading to a collision frequency of 



^ei,>90° — — 



167rm‘^v^ 



(5.37) 



In a real plasma the situation is more complex. Let us consider the motion 
of one particle, a test particle, in the held created by the other particles, the 
held particles. The flelds of these particles add to a stochastic held that 
changes continuously in time and space. Therefore, the test particle will not 
move in a hyperbolic orbit as in the interaction between two charged particles, 
instead it basically follows its original direction of motion, though not on a 
straight line but on a jittery trajectory. Because of the stochastic nature of 
the collisions, test particles with nearly identical start conditions will diverge 
in space and velocity. Most of these collisions result in small changes in the 
particle path only. Occasionally, also large deviations of the original direction 
result. These are called large- angle collisions. 

To understand the different types of collisions, we have to consider the typ- 
ical spatial scales. One characteristic scale is the Debye length Ad (3.146): the 
test particle is screened from the electric held of the charges outside the De- 
bye sphere. The Debye length can be interpreted as the range of microscopic 
electric flelds and separates the held particles into two groups: (i) particles 
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at distances larger than the Debye length can influence the test particle by 
the macroscopic flelds only, leading to gyration, drift, and oscillations; and 
(ii) particles inside the Debye sphere create a microscopic field leading to the 
stochastic motion of the test particle. 

The other spatial scale is related to the scattering angle, which in turn 
is related to the impact parameter. In the derivation of the Fokker-Planck 
equation we have assumed scattering by small angles only. The deflection 
angle will be small if the kinetic energy m^v\j2 of the test particle is large 
compared with the electrostatic potential where ro is the impact 

parameter (see Fig. 5.6). The test particle will be deflected by a small angle 
only if the impact parameter ro fulfills the condition r^Qo < ro < Ap), with 
rgoo being the impact parameter for a deflection by 90° defined as rgoo == 
ZxZFe^/(mT't’x)- ratio for deflections by small and large angles can be 
determined from the ratio of cross sections for both processes: 




The expression inside the second parentheses is the number of particles in- 
side a Debye sphere. If we assume this number to be large, (5.38) states 
that collisions leading to deflections by a small angle by far outnumber the 
collisions with large-angle deflection. A careful calculation shows that small- 
angle interactions are about two orders of magnitude more efficient in the 
deflection of test particles than the few large-angle interactions [42]. Thus, 
the Fokker-Planck formalism can be applied to the Coulomb collisions in a 
plasma too. The logarithm of the above quantity, Ac = In A, is called the 
Coulomb logarithm. 



5.4 Collisions Between Charged Particles: Formally 

We will now have a closer look at the formal treatment of collisions, following 
[281], 



5.4.1 Coulomb Collisions: Unscreened Potential 

Again we assume a test particle (this time not necessarily an electron) with 
speed vx, mass mx, and charge approaching a stationary background 
particle of charge and mass rriB- The geometry is as in Fig. 5.6, and the 
impact parameter is rg. The potential energy of the test particle is Wc,x = 
gx^c? where is the undisturbed potential of the background charge. Then 
we can also write for the potential energy 

Txr Of qtQb 

kFc T = — 5 where a = . 

’ r inso 



(5.39) 
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The angle of deflection is given by 

(y? ?^Qn° 1 0^1 

tan ^ = , where rgoo = ^ (5.40) 

2 r rnTt^x 

is the impact parameter for a deflection by 90°. The minimum distance dmin 
of the particle from the background particle and its speed at this position 
can be derived from the conservation of angular momentum, 



rriTVTro = mT^minCJmin , 



(5.41) 



and the conservation of energy, 



TUtV^ 

2 
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min 
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+ Wc,T(dmin) , 



(5.42) 



to be 



a -h + m^rQV^ 
rriTV^ 



(5.43) 



Depending on the charge of the background particle, a can be positive or 
negative, and we obtain the following for the minimum distance in the 
case of a deflection by 90°: 



^90° ^ f (V2 + l)rgoo for a > 0 
"min |(y 2 -l)r 90 o fora<0 



(5.44) 



The corresponding potential energies are 



VFc,t 




rriTV^/{V2 + 1) 
rriTV^/ly/2 — 1 ) 



for a > 0 
for q; < 0 



(5.45) 



If we assume that the potential energy is equal to the thermal energy Wth, 
the minimum distance between charged particles in a plasma is 



_ QtQb 
47T£oWth * 



In an ideal plasma, this distance is 

dmin < < Ad 



(5.46) 



(5.47) 



The cross section a deflnes a circle with the impact parameter vq as its 
radius. The differential cross section da is related to a deflection by an angle 
dip or to a deflection into a solid angle di? = Itt sin (p dp. With (5.40), we then 
obtain the following for the differential cross section for Coulomb scattering: 



da 

df? 



a 



2rriTV^ sim((y9/2) 



2 



(5.48) 
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Note that the differential cross section does not depend on the sign of the 
charges: the cross section is the same if both have the same sign or the 
opposite sign. Only the hyperbolic path of the particles is different, not the 
deflection angle (/?. 

For a large impact parameter tq, the deflection angle diverges as In 
addition, the total cross section for interaction tends towards infinity: 



<^C,unscr 






(5.49) 



This infinite cross section is reasonable for a single charge that deflects an- 
other one, in a plasma, however, the screening of the Coulomb potential has 
to be taken into account. 



5.4.2 Decelerating Force Between Particles (Drag) 



In addition, the significance of an infinite cross section is not clear: although 
even at large impact parameters deflection occurs, this might be of minor 
importance, since the deflection angle is small. Thus we have to find a method 
to describe the impact of background particles on a beam of test particles. 

For a first approximation, we assume only one background particle, with 
infinite mass and at rest. The particle beam is described by its number density 
nx and the particle parameters of velocity ux, mass mx, and charge q^. 
Owing to symmetry, momentum is transported only in the direction of Vx- 
During elastic scattering at a fixed obstacle, the magnitude of the velocity is 
conserved, and we obtain the following for the change in velocity ^Vx- 



|5t;x| 2i;x sin{(p/2) . 
In addition, we have (see Fig. 5.7) 

|(^Vt,|i| = 2ux sin((^/2) cos((/?/2) , 
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Fig. 5.7. Changes in velocity 
during a Coulomb collision 
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The number of particles streaming through a ring with area dcr during 
a time interval dt is dA^x = titVt da d^. The decelerating force Ft,^ from 
the background particle acting on the test particles can now be obtained by 
integrating over all rings den 



F, 






mx 




dATx , 

I — r da 
da dt 



(5.55) 



Using the differential cross section (5.48), this can be rewritten as 
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(5.56) 

This quantity is logarithmically divergent for 0: the drag force due to 

a background particle in an unscreened Coulomb potential diverges towards 
infinity. 



5.4.3 Coulomb Collisions: Screened Potential 



We have already introduced the Debye length in Sect. 3.7 as a parameter 
that basically describes over what distance a certain charge influences other 
charges. Using the Debye length, a screened potential or reduced potential 
can be introduced, which is related to the unscreened Coulomb potential 
by 

^d(^) = . (5.57) 

Although the screened potential decays faster than the Coulomb potential, it 
still extends to infinity. To describe the influence of the screened potential we 
make the following simplification: for scattering with an impact parameter 
^0 < Ad, the potential is assumed to be ^c, while for ro > Ad, the potential 
vanishes. In this case, the smallest deflection angle is given by 




Ad 



(5.58) 



In an ideal plasma, we therefore have 



In 




In ^90° / Ap 

^90° 



- In — - In A , 
^90° 



(5.59) 



where In A is the Coulomb logarithm. 

Equation (5.59) can also be derived from (5.56) by choosing the upper 
integration boundary as tt/ 2 instead of tt: in this case scattering is limited to 
deflection angles less than or equal to 90°. 
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We then obtain the following for the drag force: 



t,z 
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( 5 . 60 ) 



where 
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In A and In yl = In 
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( 5 . 61 ) 



4ne^ ““ “““ “ |9t«b| 

We have already mentioned that the drag force caused by the background 
plasma decreases with increasing particle speed. In this case we have a back- 
ground plasma of particle density ub containing particles with velocity vb 
and mass me- A test particle then experiences a drag force that is determined 
by the reduced mass of the particles 



mTB 



mx^B 
rriT + rriB 



and their relative velocity vr = -ux — '^^b- 
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A useful approximation to this equation is 

1 






mxB^R 



( 5 . 62 ) 



( 5 . 63 ) 



( 5 . 64 ) 



The basic result is, as stated before, that the drag force exerted by the back- 
ground plasma decreases with increasing particle speed. In addition, the drag 
force is larger for electron-ion collisions (mxB ^ “^e) than for particles of 
equal mass (mxB = m/2). 



5.5 Summary 

Kinetic theory describes a plasma as an assembly of particles with statisti- 
cally distributed properties. The basic quantity is the phase space density. In 
thermal equilibrium it is described by the Maxwell distribution. The equa- 
tion of continuity for the phase space density allows the derivation of the 
basic equations of kinetic theory: the Boltzmann equation in its most general 
form or as collisionless Boltzmann equation. For a pure electromagnetic field, 
the collisionless Boltzmann equation becomes the Vlasov equation. If colli- 
sions lead to small-angle defiections only, the collision term in the Boltzmann 
equation can be described by the Fokker-Planck equation. Despite the lim- 
itation to small-angle collisions, the Fokker-Planck equation can be applied 
to a plasma: while Coulomb collisions can lead to large-angle defiection, these 
are outnumbered by the small-angle defiections. 
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Exercises and Problems 



5.1. Describe the meaning of the mean free path. What are the physical and 
formal differences in a neutral gas and in a plasma? 

5.2. The solar wind is a proton gas with a temperature of about 1 million K. 
Plot the distribution function and determine the most probable speed and 
energy. Compare with the flow speed of 400 km/s and the kinetic energy 
contained in the flow. 

5.3. A spacecraft measures the proton distribution in the solar wind. Above 
an energy of about 20 keV, the distribution can be described as a power law 
in E with E~^. Plot the distribution and compare with the results of Problem 
5.2. What kind of distribution is this? 




Part II 



Space Plasmas 




6 Sun and Solar Wind: 
Plasmas in the Heliosphere 



... but it is reasonable to hope that in not too distant a 
future we shall be competent to understand so simple a 

thing as a star. 

A.S. Eddington, The Internal Constitution of the Stars^ 



Plasmas in interplanetary space originate from the Sun, as do most of the 
disturbances and waves embedded in them. The solar atmosphere, the corona, 
extends as solar wind far beyond the orbit of the outermost planet, Pluto, 
filling a cavity in the interstellar medium called the heliosphere. The solar 
magnetic field, frozen-in into the solar wind, is carried out and wound up 
to Archimedian spirals by the Sun’s rotation. Fluctuations and waves on 
different scales are superimposed, sometimes steepening to collisionless shock 
waves. The solar wind and the frozen-in magnetic field change during the solar 
cycle due to systematic changes in solar properties and transient disturbances 
related to solar activity. Detailed accounts on solar physics and the physics 
of the interplanetary medium are given in e.g. [113,484,572], and the solar 
corona and the physics of solar activity are described in e.g. [9,193,244,291, 
410,420,424]. 



6.1 The Sun 

For an astrophysicist, the Sun is an ordinary star of spectral class 2, also 
called yellow dwarf, and luminosity V. It consists mainly of hydrogen (about 
92% in terms of particle number or 72% in terms of mass) as the fuel for so- 
lar energy production and helium (about 8%), partly primordial and partly 
waste product of the energy generation. But the Sun is also more interesting 
than other stars: owing to its close proximity, we are able to study not only 
its electromagnetic radiation but also solar emissions of a different kind - 
plasmas and energetic particles. These not only can be studied for the quiet 
or average Sun but also for their dependence on the solar cycle, their re- 
lation to certain features on the Sun, such as sunspots and filaments, and 

^ Copyright 1926, reprinted with kind permission from Cambridge University 
Press. 




136 



6 Sun and Solar Wind 



Table 6.1. Properties of the Sun 



Radius 


tq = 696 000 km 


Mass 


M© = 1.99 X 10®° kg 


Average density 


qq = 1.91 g/cm® 


Gravity at the surface 


3© = 274 m/s^ 


Escape velocity at the surface 


Vesc = 618 km/s 


Luminosity 


L© = 3.86 X 10®® kW 


Magnetic field 


polar 


1 G 


general 


some G 


protuberance 


10-100 G 


sunspot 


3 000 G 


Temperature 


core 


15 million K 


photosphere 


5780 K 


sunspot (typical) 


4200 K 


chromosphere 


4400-10 000 K 


transition region 


10 000-800 000 K 


corona 


2 million K 


Sidereal rotation 


equator 


26.8 d 


30° latitude 


28.2 d 


60° latitude 


30.8 d 


75° latitude 


31.8 d 



their association with the violent processes of the active Sun. Many of these 
emissions are of interest to the layman, too, because they shape and influence 
our environment, from the atmosphere and the weather down to the realms 
of biology and physiology (Chap. 10). The basic properties of the Sun are 
summarized in Table 6.1. 

Most of the Sun’s emission is electromagnetic radiation, amounting to 
3.86 X 10^^ kW integrated over its surface or 6.3 x 10^ kW/m^. With Stefan- 
Boltzmann’s Law an effective temperature Teff of about 5780 K results. At 
Earth’s orbit, the solar constant, which is the solar power received per unit 
area, is 1380 W/m^. The solar radiation can be divided into flve frequency 
bands: (a) X-rays and extreme ultra-violet (EUV) with A < 180 nm con- 
tributes to about 10~^ of the total energy output. It is emitted from the 
lower corona and the chromosphere, and varies during the solar cycle with 
enhancements up to orders of magnitude during solar flares, (b) Ultra-violet 
with wavelength between 180 and 350 nm contributes to about 9% of the 
solar flux. It is radiated from the photosphere and the corona, its variations 
are similar to the ones in X-rays, although they are smaller, (c) The visible 
light between 350 and 740 nm contributes to 40% of the energy flux and does 
not vary significantly with the solar cycle. Only in extremely strong flares a 
local brightening can be observed, (d) The maximum energy flux of 51% is 
in the infrared between 740 nm and 10^ nm, showing no significant variation 
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with solar activity. As the visible light, it is emitted from the photosphere, (e) 
Radio-emission above 1 mm, originating from the solar corona, contributes 
to only 10“^°% of the solar energy flux, but can be enhanced significantly 
during solar flares. 

6.1.1 Nuclear Fusion 

The source of the Sun’s energy is nuclear fusion. This idea goes back to 
Eddington in 1926 [143] and replaced Lane’s concept of 1869 which saw the 
energy source in the Sun’s gravitational contraction. 

Inside the Sun the basic process of nuclear fusion is the proton-proton 
cycle: four protons merge to one ^He-nuclei (a-particle). In the first step, two 
protons merge to a deuteron ^H, emitting a positron and a neutrino. In the 
second step, a proton collides and merges with the deuteron, forming a ^He- 
nuclei under emission of a y-quant. When two ^He-nuclei collide, they merge 
to an Qf-particle, emitting two protons and a y-quant. The mass difference 
between the four protons and the a-particle corresponds to an energy of 
4.3 X 10~^^ J or 26.2 MeV. Formally, the reaction can be written as 

^H(p, e+z/e)^D(p, 7 )^He(^He, 2p7)‘‘He + 26.2 MeV . (PPI) 

Under solar conditions, half of the hydrogen initially present is converted 
into deuteron within 10^^ years: for two protons to merge, their distance 
must decrease below a proton radius and one of the protons has to undergo 
spontaneous /^-emission. The life-time of is only a few seconds, it immedi- 
ately captures another proton. The time scale for the fusion of the resulting 
^He-nuclei is about 10® years. Thus the time scale of the proton-proton cycle 
basically is determined by the first step, the fusion of two protons. 

The last step in the PPI cycle can be replaced by one of the two reactions: 

^He(a, 7 )^Be(e" , i/e)^Li(p, 7 )®Be(a)^He + 25.9MeV (PPII) 
or 

^He(a, y)^Be(p,y)^B(e+z/e)^*Be(a)^He -h 19.5MeV . (PPIII) 

In both cases the ^He-nucleus merges with an a-particle. In the PPII chain, 
the resulting ^Be is converted into ^Li by electron capture. The ^Li then is 
converted by proton capture into the unstable isotope ®Be which decays into 
two a-particles. In the PPIII chain, the ^Be-nucleus captures a proton. The 
resulting ®B-nucleus emits a neutrino and a positron, leading to an excited 
®Be-nucleus which decays into two a-particles. With increasing temperature, 
the latter two reactions become more important compared with the PPI cycle. 

Independent of the details of the reaction, energy is liberated in the form 
of electromagnetic radiation, positrons, neutrinos and, to a smaller extent, 
kinetic energy of protons. The energy of ys, positrons, and protons imme- 
diately is converted into thermal energy while the neutrinos escape: their 
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scattering mean free path is about 7000 AU, compared with the few cm of 
a photon. Thus although a photon travels with the speed of light, it needs 
about 100 000 years to diffuse from the Sun’s core to its surface. 

6.1.2 Solar Neutrinos 

Neutrinos emitted in the various chains of the proton-proton cycle have char- 
acteristic energies. Thus their energy spectrum gives information about the 
processes inside the Sun, allowing a test of our standard model of the Sun. 
Neutrinos are one of the current problems in solar astrophysics. Compared 
with the first observations, a larger number of solar neutrinos is being de- 
tected today; however, it still is smaller than expected. 

Part of the problem of detecting neutrinos and of interpreting the results 
arises from the measurement principle used: different techniques are sensitive 
to different neutrino energies and thus also to different parts of the energy 
production cycle. The first neutrino detector was a tank of 615 ton liquid 
perchloroethylene located about 1500 m under the Homestake mine [120]. 
This sensitivity of this instrument is such that the neutrinos are expected 
to generate a signal of 5.6 SNU^ with ^Be neutrinos contributing 1.1 SNU. 
Over the years, the Homestake experiment has measured an average of 2.56 ± 
0.23 SNU, compared with 7.6 ±1.2 SNU expected from the standard solar 
model (SSM). This discrepancy is well known as the solar neutrino problem 
[103], 

The Japanese experiments Kamiokande [175] and Superkamiokande [176], 
a 680 ton water tank 1 km underground in the Kamioka mine, have an even 
higher energy threshold and are only sensitive to the high-energy end of the 
®B neutrinos; see Fig. 6.1. These experiments show a neutrino deficit of about 
50%. 

Gallium experiments, such as SAGE, GALLEX, and GNO, have a much 
lower energy threshold and thus are able to detect also the pp neutrinos. 
These experiments give a neutrino flux of 74.7 ± 5.0 SNU, lower than the 
128 ± 8 SNU expected from the SSM [211]. 

Two possible explanations have been offered: either our model of the 
Sun or our understanding of the neutrino is wrong [19,21,218,293,366,368]. 
Fine-tuning of the solar standard model to fit the neutrino observations ap- 
pears possible without changing the energy flux at the solar surface, although 
often variations in one parameter lead to a better agreement between the 
predictions and one of the experiments while the predictions for the other 
experiments are still in disagreement with the observations. For instance, 
a reduction in the core temperature in the SSM would allow one to fit the 
Super-Kamiokande observations, while there still would be a discrepancy with 
the Homestake observations. 

^ Solar neutrino unit; 1 SNU, equals 10“^® captures per target atom per second. 
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Fig. 6.1. Energy spec- 
tra of neutrinos emitted 
in the various nuclear 
reactions inside the sun. 
The threshold energies 
of various experiments 
are indicated by verti- 
cal lines. Figure from 
S.M. Chitre [100] based 
on [20], in Lectures on 
solar physics (eds. H.M. 
Antia, A. Bhatnagar, P. 
Ulmschneider), Copy- 
right 2003, Springer- 
Verlag 



The other explanation is of great importance to elementary-particle 
physics: it suggests that neutrinos, contrary to our current understanding, 
have a mass and can oscillate between diflFerent flavors. It has been proposed 
that the electron neutrinos created in the proton-proton chains are partly 
converted into r- or /x-neutrinos during their travel time from the Sun to 
the Earth. Since only the electron neutrinos can be detected by the above 
experiments, a lower neutrino flux than would otherwise be expected results. 

A more recent experiment, the Sudbury Neutrino Observatory (SNO), 
suggests that indeed solar neutrinos change their flavor during their journey 
from the Sun to the Earth [2]. SNO consists of 1000 tons of heavy water (D 2 O) 
at a depth of over 6000 m water equivalent. Owing to the use of deuterium 
instead of ordinary hydrogen, not only can electron neutrinos be detected but 
it is also possible to measure the total neutrino flux. Thus electron neutrinos 
can be distinguished from other neutrino flavors. The results of SNO provide 
evidence that some electron neutrinos are converted during their travel time 
to a different neutrino flavor [3] at a rate which gives corrected fluxes in 
agreement with the SSM. In addition, the KamLAND experiment [147] has 
detected oscillations in antineutrinos produced in nuclear reactors. 

In sum, these results increase our confldence in the SSM because one of 
the major objections against it, the solar neutrino problem, appears to have 
its origin in elementary-particle physics rather than in solar physics. 



6.1.3 Structure of the Sun 

Nuclear fusion takes place in the Sun’s core, which is about 0.3 r© in ra- 
dius (see Fig. 6.2). It is surrounded by the radiative core or radiation zone, 
where energy is transported by radiation. In the surrounding convection zone 
energy is transported by convection. The top of the convection zone is the 
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Fig. 6.2. Internal structure of the 
Sun with typical densities and tem- 
peratures 



photosphere, the visible surface of the Sun. Here most of the visible light is 
emitted. The convection cells can be seen as granulation of the photosphere. 
From the core to the photosphere, the density decreases by more than ten 
orders of magnitude, and the temperature decreases by a factor of 3000. 

The photosphere is optically too thick to receive any electromagnetic ra- 
diation emitted from deeper layers of the Sun. Thus solar neutrinos are the 
only messengers escaping directly from the core of the Sun. All other informa- 
tion about the internal structure of the Sun is obtained indirectly. The most 
important tool is helioseismology; for a recent review see [7]. Seismology on 
the Earth is mainly concerned with sudden bursts of mechanical waves during 
either earthquakes or explosions initiated by humans to probe the terrestrial 
interior. The goal of helioseismology is similar: to use mechanical vibrations 
observed on the solar surface to obtain information about the solar interior. 
But helioseismology is different from seismology on the Earth because oscilla- 
tions are always present on the Sun. They can be identified from the Doppler 
shift of spectral lines. These mechanical vibrations of the solar surface are 
centered around a period of about 5 min [318]. These oscillations have been 
identified as a superposition of millions of standing waves with amplitudes of 
the order of a few meters and speeds of the order of a few cm/s [317,529]. 

Detailed analysis of these observations allows us to infer parameters such 
as the sound speed, density, temperature, and chemical composition in the 
solar interior [10,203,294]. This information is used to confirm and refine the 
standard solar model. 

From the viewpoint of plasma physics, a second set of results of helio- 
seismology is more important: it allows us to infer the rotation rates in the 
solar interior. This is of particular importance for the understanding of the 
solar dynamo process. The main results are the following: (a) The surface 
differential rotation persists through the convection zone, while the radia- 
tive transfer zone appears to rotate relatively uniformly [465,516]. However, 
there appears to be a shear layer beneath the solar surface extending down to 
about 0.94 solar radii, (b) The transition region, also called the tachocline, is 
located near the base of the convection zone [33] . The tachocline also seems 
to be the seat of the solar dynamo. Both features are summarized in Fig. 6.3. 
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Fig. 6.3. Contours of constant solar 
rotation rate inferred from helioseis- 
mology; figure taken from H.M. An- 
tia [7], in Lectures on solar physics 
(eds. H.M. Antia, A. Bhatnagar, and 
P. Ulmschneider), Copyright 2003, 
Springer- Verlag 



(c) There is a large-scale flow in the north-south direction, called the merid- 
ional flow [217]. The flow speed is of the order of 20 m/s, that is, about two 
orders of magnitude smaller than the rotation speed. This meridional flow is 
believed to play an important role in the solar dynamo process [373]. 

With the good coverage of helioseismological observations since the mid- 
1990s, an analysis of temporal variations has become possible. So far, no 
significant temporal variations have been detected in the internal structure 
of the Sun. But there are some solar-cycle-related modifications close to the 
surface. For instance, the 5-min oscillations are shifted by up to 0.4 jiiHz with 
higher frequencies during solar maximum [237], probably because of some 
solar-cycle-related disturbances in the outer layers of the Sun. 

In addition, the rotation rate varies with time: on the surface there ex- 
ist zonal bands of slow and fast rotation which migrate slowly from high to 
low latitudes during the solar cycle. These flow bands are correlated with 
migrating magnetic-activity bands already known from the butterfly dia- 
grams [235,495]. Helioseismology has refined this picture: the bands move 
towards the equator at low latitudes, while at latitudes above 50° they move 
polewards [8]. This corresponds to the migration seen in magnetic patterns 
(Fig. 6.28), and thus the poleward migration might be crucial in the under- 
standing of the solar dynamo and, in particular, the polarity reversal [109]. 

One of the most recent instruments used to study solar oscillations is the 
MDI (Michelson Doppler Imager) on board SOHO. Details about the instru- 
ment, a movie showing a solar quake, and many results from this instrument 
can be found at sohowww.nascom.nasa.gov/gallery/MDI/. 



6.1.4 The Solar Atmosphere 

Above the photosphere the solar atmosphere consists of three layers: the 
chromosphere, the transition region, and the corona. The corona can be seen 
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Fig. 6.4. (Left) Coronal structure during the total eclipse of 11 July 1991. Based 
on a sketch by S. Koutchmy in K. Lang [308], Sun, earth and sky, Copyright 1995, 
Springer Verlag; (Right) image of the total eclipse on 16 February 1980 (source: 
www.hao.ucar.edu) 



in visible light during a solar eclipse as a structured, irregular ring of rays 
around the solar disk. Its structure and extent vary with the solar cycle. 
Figure 6.4 shows on the left-hand side a sketch of the coronal structure during 
the total eclipse of 11 July 1991 and on the right-hand side a photo taken 
during the eclipse of 16 February 1980. Since both have been taken during the 
solar maximum, the corona is highly structured and extends far outwards. 
During the solar minimum, only few structures are visible and the corona 
appears smaller. However, the corona does not have a sharp outer boundary, 
but instead shows structures which extend into different heights and then 
fade into the background. 

The charge states of heavier elements such as O, Si, Mg, and Fe indicate 
coronal temperatures of about 1 million K. Nonetheless, the corona does not 
radiate like a black body because it is too thin. Its temperature is roughly 
independent of height, but it is about a factor of 200 higher than the photo- 
spheric and chromospheric temperature. The fastest increase in temperature 
from about 25 000 to 500 000 K occurs in the transition region, which is only 
a few hundred kilometers thick and separates the corona from the chromo- 
sphere (chromos = color) below. The latter has a height of about 2000 km 
and can be seen during a total eclipse as a thin red ring around the solar disk, 
giving the appearance of small flames. The chromospheric emission is weak 
compared with the photospheric emission because the density of the chromo- 
sphere of about 10“^^ g/cm^ is about five orders of magnitude smaller than 
the photospheric density. However, owing to the higher temperature, the max- 
imum of the chromospheric emission is in the UV. Here the chromospheric 
emission exceeds the photospheric emission: a UV instrument thus “sees” the 
chromosphere but does not look down to the photosphere. Depending on the 
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wavelength under consideration, different layers of the atmosphere are seen by 
a telescope. The first observations at different wavelengths in the UV by OSO- 
4 in 1967 showed particularly well the appearance of the polar coronal hole 
at larger heights. An example of a modern instrument is EIT (Extreme ultra- 
violet Imaging Telescope) on board SOHO; a description of the instrument, 
results, and a picture gallery can be found at umbra.nascom.nasa.gov/eit/. 



Excursion 5. Thermal emission Chromospheric emission is thermal emis- 
sion, as is the photospheric emission. Some details of this emission can be 
approximated under the assumption that the Sun is a black body. In this 
case the spectrum of the emitted radiation can be described by Planck’s law: 
the energy per unit interval of wavelength emitted by a unit surface area of 
a black body into a unit solid angle is given by 



Bx{T) = 



BAT) = 



2hA 1 



2hA 1 



c2 exp(|^)-r 



( 6 . 1 ) 



The total radiation emitted by the black body can be obtained by integration 
over all wavelengths: 



oo 

^ = 7tF = 7T j Bx{T) dA = aT^ , (6.2) 

0 

where a = 8.26 x 10“^^ cal cm“^ min“^ K~^ = 5.6708 x 10“^ J m“^ K~^. 

This is the Stefan-Boltzmann law. We have already encountered this law 
when talking about the temperature of the photosphere. The wavelength of 
the maximum of Planck’s curve can be obtained by setting the first derivative 
of (6.1) equal to zero. We then obtain Wien’s law, 

A(max) T = const — 2884 pmK . (6.3) 

This can be used to determine the wavelength of the maximum of the emission 
of a black body. 

For instance, from (6.3), the photospheric emission (5780 K) has its max- 
imum at 500 nm, well inside the visible. The wavelengths of maximum emis- 
sion at the bottom (25 000 K) and top (500 000 K) of the transition region 
are 115 nm, which is in the UV, and 6 nm, which is in soft X-rays, respec- 
tively. Thus looking at the Sun in different frequency ranges means looking at 
different layers of the atmosphere. The soft X-ray emission, since it is viewing 
greater heights, reveals the coronal holes as dark patches particularly well, but 
also shows the active regions as bright spots. An example is shown in Fig. 6.5, 
which shows the evolution of the chromosphere and lower corona during one 
solar cycle, starting at the maximum in 1990 on the left and continuing to the 
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Fig. 6.5. Variation of the Sun in soft X-rays during the solar cycle as ob- 
served by Yohkoh; source: solar.physics.montana.edu/mckenzie/Images/The_ 
Solar _Cycle_XRay_hi .jpg 



maximum in 1999 on the right. A nice movie showing the relation between 
wavelength and height can be found at sohowww.nascom.nasa.gov/ as “Five 
same-day images of Sun in different wavelengths” . □ 

The coronal emission consists of three components, the emission line or 
E-corona, the continuum or K-corona (K for the German word Kontinuum)^ 
and the Fraunhofer or F-corona. The E-corona was first observed during the 
1868 solar eclipse. But only in the 1940s did scientists understand the sources 
of the spectral lines, since these lines were not known from laboratory exper- 
iments, because they required extremely high ionization states; for instance, 
the 530.3 nm green line is from Fe XIV, the 637.4 nm red line is from Fe X, 
and the 569.4 nm yellow line is from Ca XV. These charge states indicate 
temperatures of more than one million K. These high temperatures also allow 
us to understand the great height of the corona. The photospheric tempera- 
ture of 5780 K would lead to a scale height of about 150 km. At a distance 
of one solar radius above the photosphere, the density would have dropped 
by a factor of exp(— 696 000/150), which is almost zero. With a coronal tem- 
perature of two million degrees, a scale height of 10^ km results, allowing the 
large extent of the corona. 

The main visible coronal emission, the K-corona, is not a real emission but 
is photospheric light scattered from coronal electrons. Therefore the coronal 
electron density can be inferred from the intensity of the K-corona. Thus 
Fig. 6.4 also can be interpreted as an electron density distribution. The K- 
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corona is linearly polarized because the electrons are aligned in the coronal 
magnetic field. 

Typical coronal features include bright arcades, which can be interpreted 
as closed magnetic loops where electrons are stored, and helmet streamers, 
which are arcades from which a thin beam extends upwards. The ray-like 
structures suggest open field lines with electrons streaming away from the 
Sun. The rather dark regions are depleted of electrons. These coronal holes 
can also be seen as dark patches in soft X-ray images and are the source 
regions of the fast solar wind. 

Depending on the underlying structures, the electron density at any 
given height can vary by more than three orders of magnitude [297]. In 
the lower corona, the intensity gradient is very steep, with a scale height 
of about O.lrQ. Thus, even with an extremely good radiometric resolu- 
tion, it is not possible to create images of the corona from just above 
the photosphere out to ten or twenty solar radii. Instead, different in- 
struments have to be combined, as has been done, for example, with the 
LASCO coronograph on the SOHO spacecraft [161]. Examples and fur- 
ther details can be found at sohowww.nascom.nasa.gov/gallery/LASCO/ 
or lasco- WWW . nrl . navy . mil/lasco . html. 

The last component, the F-corona, or Fraunhofer corona, results from 
scattering by slow-moving dust particles. It extends into the interplanetary 
medium, where it is observed as zodiacal light. The F-corona is not part of 
the solar atmosphere. 



6.1.5 The Coronal Magnetic Field 

Figure 6.6 is a very simplified schematic, showing only the most important 
features, namely coronal holes and helmet streamers. The latter develop over 
active regions, the legs of the helmet streamer connecting regions of opposite 
magnetic field polarity. Electrons are captured inside these loops, thus the 
helmet streamer is a bright feature. The coronal holes on the other hand are 
regions with open field lines, allowing for a fast electron escape. Thus they 
appear as dark features, often with rays indicating the direction of the field. 

Over the poles of the Sun coronal holes are dominant. Occasionally, they 
can extend down to the solar equator or even into the opposite hemisphere. 




Fig. 6.6. Sketch of the solar corona. 
Two helmet streamers are shown to- 
gether with the coronal holes, the so- 
lar equator, and the nominal loca- 
tion of the source surface 
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Fig. 6.7. Photospheric magnetic-field mea- 
surements from National Solar Observatory 
(source: www . nso . noao . edu/ synoptic/) 



Streamer-like configurations are confined to the streamer belt, which is as- 
sociated with the active regions. Thus the streamer belt’s extension and its 
inclination relative to the solar equator varies over the solar cycle. The re- 
sulting magnetic field pattern with different magnetic field polarities on both 
sides of the streamer belt are carried out into interplanetary space by the 
solar wind and lead to a sector structure as described in Sect. 6.3.2. 

While the coronal magnetic field as described above is rather orderly, 
spectral line observations of the photosphere reveal a complex magnetic field 
pattern associated with sunspots and active regions. Figure 6.7 shows an 
example for the structure of the photospheric magnetic field. The bright 
and dark spots indicate regions of strong magnetic fields, the color gives the 
field’s polarity. Daily observations of the visible hemisphere of the Sun can 
be combined to give a map of the photospheric magnetic field such as shown 
at the top of Fig. 6.8. 

Excursion 6. Zeeman effect The basic tool used to determine the photo- 
spheric magnetic field is the Zeeman effect, that is, the splitting of a spectral 
line Ao in a magnetic field into a triplet of lines, with one member at wave- 
length Ao, also the 7r-component, and two members at Aq ± 5A, where 

5A = ^ ^ = 4.7 X 10-1^ \lgB , (6.4) 

g being the Lande g-factor. The split lines are linearly polarized: the tt- 
component is polarized parallel to the field, and the side lines, also called 
(Jv and ctr, are polarized perpendicular to it. Thus, when viewed parallel 
to the magnetic field (longitudinal field), only the av and ctr, are visible 
as circularly polarized lines (see Fig. 6.9). Viewed perpendicular to the field 
(transverse field), all three components are visible, as linearly polarized lines. 
Thus the Zeeman effect allows the meeisurement of both the magnetic field 
strength and its direction. 

It should be noted that the photospheric magnetic field is not the only 
factor influencing the spectral lines. Photospheric motions lead to shifts in 
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Fig. 6.8. Recon- 
structed photosphe- 
ric magnetic field 
for Carrington ro- 
tation 2004 (source: 
wso . St anf ord . edu/ 
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and calculated pho- 
tospheric field for 
the same rotation 
(source: quake . 

stanford.edu/~wso/ 
coronal.html) 



frequency owing to the Doppler effect. Thus a careful analysis of the Fraun- 
hofer lines provide a wealth of information about the photosphere. □ 

Coronal loops, structures such as filaments and prominences, and in situ 
observations in interplanetary space suggest that many of the small-scale 
photospheric structures form closed loops within less than two solar radii 
(see Fig. 6.16). Thus a solar source surface can be defined: the small-scale 
structures are closed below it and the resulting overall field pattern is carried 
outwards by the solar wind. The source surface is at a height of about 2.5 
solar radii and can be determined from the photospheric field pattern using 
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Fig. 6.9. Zeeman splitting when viewed into the fiel {lejfj and perpendicular to it 
(right) 
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Fig. 6.10. Magnetic field pattern on 
the source surface for solar-minimum 
{upper panel) and solar-maximum 
conditions {lower panel). The thick 
line is the neutral line separating the 
fields of opposite polarity; the thin 
lines are equipotential lines 



potential theory [225,457,571]. The constraints on the field are as follows: (a) 
the magnetic field at the source surface is directed radially, and (b) currents 
either vanish or are horizontal in the corona. The resulting map is shown at 
the bottom of Fig. 6.8. 

Figure 6.10 shows source surface maps for solar minimum and maximum 
conditions. A distinctive feature is the neutral line (thick line), separating the 
two magnetic field polarities. At the neutral line, the radial magnetic field 
vanishes. During solar minimum (upper panel) it is roughly aligned with the 
solar equator while with increasing solar activity (lower panel) the neutral line 
becomes wavy and extends to higher solar latitudes. During solar maxima, 
when the Sun changes polarity, the inclination of the neutral line is maximal. 
Since the neutral line separates magnetic fields of opposite polarity, a current 
must flow inside it: the neutral line is a current sheet. In Fig. 6.6 it would 
extend outwards through the tips of the helmet streamers: its extension into 
interplanetary space is called the heliospheric current sheet (HCS). 

The above description gives the impression of a rather static transition 
from the photospheric to the coronal magnetic held. Recent observation with 
the Transition Region And Coronal Explorer (TRACE), launched in 1998, 
revealed a highly filamented corona filled with flows and other dynamic pro- 
cesses [194]. Variability and motions are observed at all spatial locations in 
the atmosphere and on very short time scales. With the greatly improved 
spatial resolution of TRACE, a number of new properties in the corona have 
been identified, as follows, (a) Fine structures: the corona in active regions 
consists of numerous threads of emitting plasma, which are all clearly sep- 
arated from each other and in continuous motion. If these threads interact, 
reconnection can occur, (b) “Moss” is an intricate, dynamic fine structure 
near the base of active regions, (c) Dynamic structures that change their 
overall large-scale topology seem to indicate that new magnetic flux emerges 
in these regions, (d) Bundles of long, nearly linear structures emanating from 
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active regions in the vicinity of sunspots are related to steady outflows of 
hot material at roughly the local sound speed, (e) The cool, absorbing ma- 
terial embedded within the hot corona close to active regions is also in a 
highly dynamic state. Examples and the most recent results can be found at 
vestige . Imsal . com/TRACE/. 



6.2 The Solar Wind 

The corona does not show a well-deflned outer boundary but ragged struc- 
tures blending into the background. Thus how far does it extend? 

The Earth’s atmosphere is stationary, shaped by an equilibrium between 
incoming solar radiation and outgoing terrestrial radiation. On the Sun, the 
situation is different. Here the temperature is much higher and the solar 
atmosphere is not stable but blown away as solar wind. Ailing the entire 
heliosphere. The first direct measurements of the solar wind [204] started in 
1960. However, a particle flow from the Sun towards the Earth had already 
been suggested at the beginning of the twentieth century. To explain the 
relationship between aurorae and sunspots, in 1908 Birkeland [50] proposed 
a continuous particle flow out of these spots. Alternatively, Chapman [91] 
and Chapman and Ferraro [95] suggested the emission of clouds of ionized 
particles during flares only. Except for these plasma clouds, interplanetary 
space was assumed to be empty. Evidence to the contrary came from an 
entirely different source, i.e. the observation of comet tails. The tail of a 
comet neither follows the path of the comet nor is directed exactly radially 
away from the Sun. Instead, its direction deviates several degrees from the 
radial direction. Hoffmeister [226,227] suggested that solar particles and not 
the solar light pressure shape the comet tails. Biermann [48] noted that the 
fainter dust tails of the comets are indeed directed radially and most probably 
shaped by light pressure, especially since their spectra resemble the solar 
spectrum. To explain the shape of the main tail, he too invoked a continuous 
solar particle radiation. 

6.2.1 Properties 

The high variability of the solar wind in space and time reflects the underlying 
coronal structures. The most important features can be summarized as follows 
[183,473,500]: the solar wind is a continuous flow of charged particles. It is 
supersonic with a speed of about 400 km/s, which is 40 times the sound speed 
in the solar wind. A plasma parcel travels from the Sun to the Earth within 
roughly four days. The solar wind carries the solar magnetic held out into 
the heliosphere, the magnetic held strength amounting to some nanoteslas 
at the Earth’s orbit. The most recent observations are by WIND (see web. 
mit .edu/af s/athena/org/s/space/www/wind.html for details) and by the 
plasma instrument on board ACE (see www.srl.caltech.edu/ACE/ for the 
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instrumentation and general description and www.sel.noaa.gov/ace/ for 
the solar wind data). 

Two distinct types of plasma flow are observed - the fast and the slow 
wind, see, for example, [449,473,541]. The fast solar wind originates in the 
coronal holes, the dark parts of the corona dominated by open held lines. 
Fast solar wind streams are often stable over a long time period (some solar 
rotations) and variations from one stream to another are small. The fast 
solar wind has flow speeds between 400 km/s and 800 km/s, the average 
density is low, about 3 ions/cm^ at 1 AU. About 4% of the ions are helium. 
This ratio is very stable over different fast streams. The average particle flux 
is about 2 x 10^^ m“^ s“^, implying a total particle loss from the Sun of 
about 1.3 X 10^^ /s. The proton temperature is about 2 x 10^ K, the electron 
temperature is about 1 x 10^ K. 

The slow solar wind has lower speeds ranging between 250 km/s and 
400 km/s. Its density is about 8 ions/cm^ at 1 AU, and the flux density is 
about twice as large as that in the fast solar wind. During solar minimum 
the slow solar wind originates from regions close to the current sheet at the 
heliomagnetic equator. The relative amount of helium is highly variable, its 
average is about 2%. During solar maximum the slow solar wind originates 
above the active regions in the streamer belt, and its helium content is about 
4%. Compared with the fast solar wind, it is highly variable and turbulent, 
often containing large-scale structures such as magnetic clouds or shocks. 
The proton temperatures are markedly lower, about 3 x 10^ K, while the 
ion temperatures are similar. As in the fast wind, the temperature is always 
higher parallel to the magnetic held than perpendicular to it. On the average 
it is T|| ~ 2T_i_. 

Despite their differences, fast and slow solar wind streams also have simi- 
larities. For instance, the momentum flux M = npTripV^ on average is similar. 
The same is true for the total energy flux, despite the fact that its individual 
components, such as kinetic energy, potential energy, thermal energy, electron 
and proton heat fluxes, and wave energy flux, are different. 

Figure 6.11 shows hourly averages of solar wind parameters during solar 
minimum conditions (Carrington rotation 1896). From top to bottom, the 
panels give the angles between the solar wind and the Sun-Earth line in the 
north-south and east-west directions, the thermal speed of the solar wind, 
its density, and the bulk speed of the solar wind. Two fast solar wind streams 
start at DOY 143 and 150; a third, albeit slower stream starts late at DOY 
160. All three show up as steep increases in solar wind speed and thermal 
speed. The density before the arrival of the fast stream increases as solar 
wind is swept up, and it decreases abruptly as the spacecraft enters the fast 
stream. Prior to the arrival, the flow is slightly from the east, and with the 
arrival it turns abruptly to a flow slightly from the west. 

A special feature of the solar wind is coronal mass ejections (CMEs). Here 
the bulk speed is in the range 400 km/s to 2000 km/s and the composition is 
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Fig. 6.11. Solar wind under solar minimum conditions, WIND measure- 
ments (source: web .mit . edu/af s/ athena/org/s/space/www/wind/wind_f igures/ 
wind_95may20 . gif ) 



significantly different; in particular, up to 30% of the ions can be ct-particles, 
and even Fe^^'^ or He“^ can be observed occasionally. 

Is Slow Solar Wind Always the Same? Observations with the LASCO 
coronograph on SOHO suggest small-scale density inhomogeneities in the 
solar wind [477], originating from the tips of helmet streamers. These density 
variations are assumed to travel along the heliospheric current sheet where 
they had been detected in situ much earlier as localized maxima in proton 
density associated with the passage of a sector boundary [55]. Most likely 
these blobs form when a small plasmoid is disconnected from the helmet 
streamer by reconnection [280,546]. Thus, the slow solar wind seems to have 
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two distinct sources: plasmoids forming in the streamer belt and strongly 
overexpanding flux tubes at the boundaries of the coronal holes. 



6.2.2 Solar Wind Models 



The charge states of heavy ions indicate temperatures of about 10® K in 
the corona, independent of height. Thus hydrogen is completely ionized and 
the corona basically can be described as an electron-proton gas with small 
admixtures of heavier elements. In the lower corona, the electron density is 
about 10® to 10^ cm“® and it decreases with a scale height of about 0.1 Tq. 
One of the basic questions in understanding the corona and the solar wind is 
related to heating: because the photosphere only has a temperature of about 
5800 K, how can the corona be heated up to a million Kelvin? 



Chapman’s Hydrostatic Corona. One of the flrst models of the corona, 
introduced by Chapman in 1957 [92], avoided this question and simply de- 
scribed the corona as a static atmosphere, an equilibrium between the pres- 
sure gradient force and gravitation: 



dp 

dr 




(6.5) 



with G being the universal constant of gravitation and Mq the Sun’s mass. 

Owing to some unknown mechanism, heat is supplied continuously from 
the photosphere to the corona. Thus thermal energy has to be trans- 
ported outward through the corona by heat conduction with a heat flow 
Qh = — 47rr^XdT/dr. Because the electrons are the more mobile part in the 
electron-proton gas, the heat basically is transported by electron motion. 
From the electron density distribution one expects a weakly height-dependent 
heat conduction coeflicient %. For a completely ionized corona, the variation 
of density and temperature with height then is 
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Under coronal conditions, heat conduction is about a factor of 20 more effi- 
cient than in copper. Thus the temperature decreases only weakly with height 
(see (6.7)), with the consequence that “... the coronal gas surrounding the 
Earth may be expected to have a temperature of order of 100 000 K. This is 




6.2 The Solar Wind 153 



... consistent with my main inference - that the Earth is surrounded by very 
hot coronal gas, which greatly distends our outer atmosphere and that heat 
must flow from it by conduction into our atmosphere” [93], p. 477. 



Parker’s Hydrodynamic Corona. But the continuous particle flow in- 
ferred from the comet tails is in contrast to a static atmosphere extending 
far behind Earth’s orbit. Parker [393] argued that the high temperatures do 
not allow for a stationary corona and that heat is transported by particle 
streaming. Thus the solar atmosphere and the continuous particle radiation 
from the Sun both are the same. Parker used a hydrodynamic approach. Thus 
the hydrostatic equation has to be complemented by a term describing the 
fluid motion, leading to Bernoulli’s equation. However, Parker did not solve 
the heating and heat transport problems. 

In a simple approach, only protons are considered because they are the 
dominant ion species and carry virtually all of the mass of the solar wind. 
The momentum balance then is g{u • V)tx = — Vp — qMqG/t^ or, in the 
one-dimensional case for a spherically symmetric corona. 
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The factor 2 in the pressure term nk^T considers that both electrons and 
protons contribute a factor of tiUbT to the pressure. With the equation of 
continuity n(r) Ur(r) r^ = no ro, (6.9) can be written as 
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To describe the temperature gradient, Parker assumed an isothermal corona 
above about 1.4r© as suggested by the charge states of the heavier ions. 

Solutions of (6.10) are shown in Fig. 6.12. There are two curves repre- 
senting special solutions. These curves intersect at the critical point (uc,rc) 
with 
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Fig. 6.12. Topology of different so- 
lutions for the solar wind equation 
(6.10) 
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Fig. 6.13. Radial de- 
pendence of the so- 
lar wind speed for dif- 
ferent coronal tempera- 
tures, based on [395] 



The solution ‘A’ is the observed solar wind: it starts as a subsonic flow in the 
lower corona and accelerates with increasing radius. At the critical point 
the solar wind becomes supersonic. The solution than takes the form 
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with w = y/GM^Jr. For large distances, (6.12) can be approximated as 
Ur ~ 2uc y/\n{r/rQ). Figure 6.13 shows the radial dependence of the solar 
wind speed for different coronal temperatures. For a coronal density of about 
2 X 10^cm~^ and a temperature of one million Kelvin, the critical point is 
at about 6rQ. The solar wind than accelerates up to about 4Or0, afterwards 
propagating at a nearly constant speed of 500 km/s. The supersonic flow 
does not extend indefinitely, its density decreases during expansion. At a 
certain radial distance, most likely beyond 70 AU, the solar wind pressure 
will become too small to further support a supersonic fiow. Where the flow 
is slowed down to subsonic speed, a termination shock forms. The solar wind 
then continues as a subsonic fiow until the pressure of the interstellar gas 
becomes larger than the combined pressure of the solar wind and the frozen-in 
magnetic field. This is the heliopause, the boundary of the heliosphere which 
is expected beyond 100 AU. In front of the heliopause, a bow shock might 
develop where the interstellar gas is slowed down by the obstacle heliosphere. 

Solutions ‘C’ and ‘F’ of the solar wind equation also start as subsonic 
fiows in the lower corona. In solution ‘F’ the speed increases only weakly 
with height and the critical velocity is not acquired at the critical radius. 
The fiow continues to propagate radially outward, but then slows down and 
can be regarded as a solar breeze only. In ‘C’ the fiow has accelerated too 
fast and has become supersonic before reaching the critical height. It then 
turns around and fiows back towards the Sun as a supersonic fiow. 

The other curve going through the critical point, ‘B’, starts as a super- 
sonic flow in the lower corona and becomes subsonic at the critical point. This 
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flow would continue to propagate outwards at subsonic speed. If the flow had 
decelerated less, as in curve ‘D’, it still would be supersonic at the critical 
point, where it would accelerate again, leaving the Sun as a supersonic flow. 
Solution ‘E’ is entirely different. It starts as an inward flow blowing subsoni- 
cally towards the Sun from inflnity. The flow accelerates as it approaches the 
Sun, becoming supersonic at some distance larger than Uc. At that point the 
flow turns back and propagates outwards as a supersonic flow. 

Only one of the mathematically possible solutions of (6.10), curve ‘A’ 
in Fig. 6.12, is an approximation of the solar wind. But how good is this 
model? The assumption of an isotropic pressure p is valid near the Sun where 
isotropy can be maintained by collisions. As the plasma moves farther out, 
collisions become less frequent and the pressure parallel to the magnetic fleld 
is twice the perpendicular one. The temperature is assumed to be isotropic, 
too. Again, this might be true close to the Sun but not at the orbit of Earth. 
In addition, electron and proton temperatures are not the same, as is assumed 
in the model. These differences do not change the general character of the 
solution but modify the numbers. Another limitation is the consideration of 
only one particle species, namely protons. Since a-particles are four times 
heavier than protons, even the 2% to 4% of He in the solar wind contribute 
signiflcantly to the momentum transport. Thus an additional set of equations 
should be considered, leading to a reduction of the flow speed. 

A more severe limitation concerns the flelds. In the derivation of the 
hydrodynamic flow, no effects of the magnetic and electric flelds were con- 
sidered: the electromagnetic forces in the momentum balance were ignored. 
A more elaborate model should consider flelds, too. In such a magnetohydro- 
dynamic (MHD) model the critical point is lower in the corona, for average 
conditions at a height of about two solar radii, which is close to the height of 
the (flctitious) source surface. The general character of the solution nonethe- 
less is the same as in the hydrodynamic model, for a comparison of such a 
model with data see e.g. [532]. 

Parker’s hydrodynamic solar wind model nevertheless is a valid approxi- 
mation of the solar wind in quiet conditions. A comparison of model results 
with the solar wind observations, however, reveals a rather puzzling fact: the 
hydrodynamic model is more appropriate to describe the slow wind originat- 
ing in the streamer belt with its complex magnetic fleld structures than the 
fast solar wind blowing directly out of the coronal holes, a situation which is 
much closer to the assumptions inherent in the model. 

Overexpansion of the Solar Wind. Observations suggest that the so- 
lar wind and its sources might be even more complicated. In particular, it 
appears that the solar wind does not expand radially. An expansion factor 
can be deflned as the ratio between the cross-sections of a flux tube at the 
source surface and in interplanetary space. High-speed solar wind streams 
from coronal holes then are associated with small expansion factors while the 
low-latitude slow streams are associated with large expansion factors [545]. 
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These slow streams originate at the boundaries of coronal holes, their strong 
expansion then causes them to fill the space above low-latitude closed mag- 
netic field structures. Since the expansion factor can be inferred from the 
magnetic field strength and distribution on the source surface, magnetograms 
can be used to predict the solar wind speed at the orbit of the Earth. 

6.2.3 Coronal Heating and Solar Wind Acceleration 

Although the hydrodynamic description of the solar wind is a reasonable and 
valuable approach, one fundamental problem has been neglected: the heating 
of the corona. Basically, two lines of thought have evolved: heating by MHD 
waves and turbulence or, alternatively, small-scale impulsive energy releases 
due to reconnection, sometimes called nano-flares. A recent review about 
heating mechanisms is given in [141,530]. 

Waves and Turbulence. Of the basic MHD waves, the fast and slow 
magneto-acoustic waves are compressive, while the Alfven wave is a non- 
compressive propagation of fluctuations along the field. In a collisionless 
plasma, the Alfven wave propagates undamped, whereas the magneto-acoustic 
waves undergo Landau damping. Wave energy is then converted into thermal 
energy, mainly of the ion component in the plasma. Under solar conditions, 
the slow mode is damped very strongly, while the fast mode can propagate 
up to a distance of about 20rQ. On the basis of this, Barnes et al. [31] devel- 
oped a model of the solar wind with coronal heating by fast magneto-acoustic 
waves. While the Alfven wave is not damped, it nonetheless contributes to 
momentum transport and can be interpreted as a radiation pressure, accel- 
erating the plasma. Although non-thermal broadening of some spectral lines 
indicates the existence of waves or turbulence in the lower corona, it is not 
completely understood which waves these are, how they propagate outward, 
and whether the observations really are indicative of wave fields or, rather, 
of turbulence. A brief review can be found in [30,333]; recent developments 
are described in [526]. 

Excursion 7. Landau damping. Landau damping is a characteristic feature 
of collisionless plasmas: waves are damped without energy dissipation by 
collisions. In Landau damping, a propagating wave accelerates gas particles 
contained in a distribution function that happen to have a similar direc- 
tion and speed to the wave: Landau damping is therefore a resonance effect 
or resonant damping. Landau damping is therefore an example of resonant 
wave-particle interaction. Chen [97] compares this process to a surfer riding 
an ocean wave: when surfing, a surfer launches him/herself in the propa- 
gation direction into a steepening part of an incoming wave and is further 
accelerated by this wave. Because a distribution function normally contains 
many more slower than faster particles, the wave loses energy by accelerating 
the slower particles. Thus the original distribution function (dashed line in 




6.2 The Solar Wind 157 




Fig. 6.14. Landau damping: around the phase speed 
the wave modifies the original particle distribution 
(dashed) by accelerating particles 



Fig. 6.14) is deformed, with particles being removed from lower to higher 
speeds around the phase speed of the wave: the distribution function flattens 
near the phase velocity, where particles are in resonance with the wave. In- 
teraction between the particles might lead to a redistribution of this energy 
gain, tending to reestablish the original distribution. 



Impulsive Energy Release: Reconnection So far, the corona has been 
treated only hydrodynamically and the magnetic held has been ignored. Even 
for coronal heating by MHD waves, the held has been considered as only a 
carrier for the waves, while its energy content has been neglected. The con- 
version of held energy into thermal energy has therefore not been considered 
as a heating mechanism, although this concept is widely applied in space 
physics on larger scales: models for the acceleration of particles in the magne- 
tosphere’s tail or solar flares often involve reconnection because the magnetic 
held is the only source of energy available. 

As we saw in Sect. 3.5, reconnection requires flelds of opposite polarity. 
The photosphere, as the top of the convection zone, is in continuous mo- 
tion with bubbles rising and falling and plasma flowing in and out, as can 
be inferred from the Doppler shift of spectral lines and even seen directly 
in the TRACE data [194]. The plasma motion also shuffles the magnetic 
held around: magnetic threads emerge in the intergranular lanes between the 
granulation cells. While the latter are associated with an upwelling flow, a 
downflow of plasma is observed in the former. A magnetic flux tube therefore 
sits in the center of a tornado of downflowing gas. Thus, on a small scale, mag- 
netic held configurations suitable for reconnection will form frequently [214], 
eventually converting held energy into thermal energy [422]. Observational 
evidence for such small-scale impulsive energy releases is found in electro- 
magnetic radiation, in particular in so-called bright X-ray points [195] and 
small-scale exploding EUV events. The recent TRACE observations reveal 
that X-ray bright points are not really points, but can be resolved into highly 
dynamic loops with distinct features in their footpoints [194]. These observa- 
tions lend additional support to magnetic reconnection as a mechanism for 
coronal heating. 

But heating by microflares still provides grounds for debate. One prob- 
lem is related to the energy provided by these flares: it might not be enough 
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to provide the required heating [12], although more recent studies using 
SOHO/EIT [44] data and TRACE [398] data tend to find larger energy re- 
leases. It is also suggested that, since reconnection can generate Alfven waves, 
the two models might be connected [479]. 



6.3 The Interplanetary Magnetic Field (IMF) 

The photospheric magnetic field was discovered by Hale in 1902. The splitting 
of spectral lines due to the Zeeman effect suggests a photospheric field in 
the order of 10“^ T or 1 G (G: gauss) outside and 3000 G to 4000 G inside 
sunspots. Within less than 2 solar radii, this complex and highly variable field 
is reduced to a rather simple, radially directed one. Since the conductivity 
of the solar wind is high, the magnetic field is frozen into it and carried 
out into interplanetary space. The Sun’s rotation winds up these field lines 
to Archimedian spirals. Thus with increasing radial distance, the originally 
radial magnetic field becomes more and more toroidal. 

6.3.1 Spiral Structure 

The Sun rotates with a sidereal rotation period of 27 days. The solar wind 
fiows radially away from the Sun, carrying the frozen-in magnetic field. While 
the solar wind propagates outward, the base of the field line frozen into the 
plasma parcel is carried westward, forming an Archimedian spiral,^ as shown 
in Fig. 6.15. A similar effect can be observed with a rotating sprinkler; thus 
the deformation of the field lines also is called the garden-hose effect. 

The equation of the Archimedian spiral can be derived from the displace- 
ments Ar and A(^. Initial conditions of the plasma parcel on the Sun are a 
source longitude (po and a source radius tq. At a time t the parcel then can 




Fig. 6.15. Deformation of a magnetic field line due to the combination of a radial 
plasma flow and the Sun’s rotation. The numbers indicate consecutive times after 
a plasma parcel has left the Sun at time to 

^ An Archimedian spiral is defined as a curve resulting from a motion u of a point 
along an axis that rotates with constant angular speed u around the origin: 
r = a(f with <p = v/u. 
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be found at the position (f{t) = and r{t) = u^owii + ^o- Eliminating 

the time yields the equation for the Archimedian spiral: 



T — Ugowi “1“ ^0 • (6.13) 

LOq 

With tan-0 == ujQr/usowii the path length s along the spiral is given as 

5 = ('^ + 1 + In -h (6.14) 

The magnetic field in the equatorial plane can be expressed in polar coor- 
dinates B = {Bj:, B^). The magnitude of B depends on radial distance only: 
|.B| = B{r). Gauss’s law in spherical coordinates (see A. 3. 3) yields 

V-B=\-^{r^Br) = 0 (6.15) 

or 

or r^Bj: = ToBro- Thus the magnetic fiux through spherical shells is conserved 
and the radial component of the field decreases as 

B, = Bo(^)". (6.16) 

Since the magnetic field is constant, it is dB/dt = 0. Prom the frozen-in 
condition (3.84) we then get V x (tt x B) = 0, or in spherical coordinates 

1 O 

- (r{u^B,-u,B^))=0. (6.17) 

r or 

Thus we have r{u^Bj: — u^B^) = const. Let us assume ro to be at the source 
surface. There B is radial and we get 

r'UnpB-^ — rQU^p^Bo r^uj^Bo . (6.18) 



In the last step, the angular speed of the Sun is used to describe the azimuthal 
component of the solar wind speed at the source surface. From (6.18) the 
azimuthal component of the magnetic field is 
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(6.19) 



For large distances, rw© > u^, (6.19) is approximately B^ = -rujQBr/ur. 
The azimuthal component therefore decreases with 1 /r while the radial com- 
ponent decreases as 1/r^. The field strength decreases with r as 
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(6.20) 
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The angle ijj between the magnetic field direction and the radius vector from 
the Sun is tan 7 /; = For large distances this reduces to tan^ — uo^r/ur. 

At the Earth’s orbit, tan-?/^ is about 1 for typical solar wind conditions, and 
thus the field line is inclined by 45° with respect to the radial direction. 

The current in the heliospheric current sheet is related to the magnetic 
field by Ampere’s law (2.7). In spherical coordinates the current density in 
the plane of the ecliptic then is 

. ro . . . B^ro 

Jr = Jo • — and = Jo^— • (6-21) 



6.3.2 Sector Structure 

So far we have considered only the shape of the interplanetary magnetic field 
lines but not their direction. The first long-term observations of the IMF by 
IMP 1 in 1963 over a couple of solar rotations revealed a sector pattern as 
shown in Fig. 6.16: the magnetic field polarity is uniform over large angular 
regions and then abruptly changes polarity. These magnetic field sectors are 
stable over many solar rotations. At most times either two or four sectors can 
be observed: if the neutral line is tilted without any wiggles as indicated in 
the upper panel of Fig. 6.10, a pattern of two magnetic field sectors arises; 
a wavy neutral line leads to four or more sectors. During solar maximum 




Fig. 6.16. Relationship between the photospheric, source surface and interplan- 
etary magnetic field. Dashed and solid lines indicate negative and positive mag- 
netic field polarities, respectively. Reprinted from K.-H. Schatten et al. [457], Solar 
Physics 6 , Copyright 1969, with kind permission from Kluwer Academic Publishers 




6.3 The Interplanetary Magnetic Field (IMF) 161 

the sector structure is complex and distorted by a large number of transient 
disturbances. 

6.3.3 The Ballerina Model 

The sector boundaries are the extension of the neutral line into the interplan- 
etary medium, the so-called heliospheric current sheet (HCS). Figure 6.17 
shows the HCS extending far into the interplanetary medium. The inclina- 
tion of the neutral line defines the width of a cone inside which an observer in 
space alternately sees different polarities of the coronal/interplanetary mag- 
netic field. The maximum inclination of the neutral line at each time is called 
the tilt angle. It can be used as an alternative measure of solar activity and 
is an indicator of the range over which both field polarities can be observed. 

Figure 6.18 shows a three-dimensional sketch of the wavy current sheet 
with some field lines for solar minimum conditions. During solar maximum a 
more complex and more wavy structure would be observed and the neutral 
line would be bent towards high latitudes, as also suggested in the cross- 
section shown in Fig. 6.17. Figure 6.18 anticipates the overexpansion of the 
solar wind: field lines from the borders of the coronal holes expand such that 
they overlay the closed magnetic field regions on the Sun, even extending 
down to the heliospheric current sheet. 



Fig. 6.17. Heliospheric current sheet and 
definition of the tilt angle a as inclination 
of the neutral line in the tilted dipole model 
proposed in [413]. The width of the cone is 
twice the tilt angle a 



Fig. 6.18. Current sheet in 
the inner heliosphere in the 
Ballerina model. The thick 
lines indicate the magnetic 
field lines. Based on [491] 
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6.3.4 Corotating Interaction Regions 

Fast and slow solar wind streams originate on the Sun. While these streams 
propagate outward, the frozen-in magnetic field is wound up to Archimedian 
spirals. In a slow stream, the field line is curved more strongly than in a 
fast one. Since field lines are not allowed to intersect, at a certain distance 
from the Sun an interaction region develops between the two streams. It was 
soon realizes that “the collision of these plasmas will lead to the formations 
of two shock waves and a tangential velocity discontinuity between them” 
[132]. Because this structure rotates with the Sun, it is called a corotating 
interaction region (CIR). Often the source locations of the fast and slow solar 
winds are rather stable and an observer in space sees the CIR again during 
the following solar rotations. In this case, it is called a recurrent corotating 
interaction region. 

Figure 6.19 shows an idealized sketch of the evolution of a CIR in the 
inner heliosphere. On the Sun, there is an abrupt change in solar wind speeds 
from fast to slow. As these streams propagate outward, fiow compression and 
defiection on both sides of the interface tend to smoothen the jump, leading 
to a continuous increase in plasma speed. The region of compressed plasma 
at the transition between the fast and slow stream at 1 AU typically extends 
over about 30° while the plasma might originate from a coronal region as 




Fig. 6.19. Idealized 
view of a corotat- 
ing interaction region 
(CIR) in the inner 
heliosphere. Reprinted 
from R. Schwenn [473] , 
in Physics of the in- 
ner heliosphere, vol. I 
(eds. R. Schwenn and 
E. Marsch), Copyright 
1990, Springer- Verlag 
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wide as 90° or more. Thus magnetic field sector boundaries often found close 
to the compression region are not necessarily related to the interface but 
might originate in a coronal region far from the boundary between the fast 
and slow streams. This is also evident from observations of the corona and 
the photosphere: the boundaries of the coronal holes are not related to the 
neutral line of the photospheric field. In particular, the polar coronal hole 
can extend into the opposite hemisphere, crossing the solar equator as well 
as the current sheet. 

With increasing distance from the Sun, the characteristic propagation 
speeds, which are the sound and Alfven speeds, decrease. At some distance 
between 2 and 3 AU, the density gradient on both sides of the compression 
region becomes too large and a shock pair develops, propagating away from 
the interface [247,490]. The shock propagating into the slow wind is called the 
forward shock, the one propagating into the fast wind is the reverse shock. 

Corotating interaction regions tend to distort or even destroy all small- 
scale fiuctuations and disturbances propagating outward from the Sun. In the 
outer heliosphere, the magnetic field and therefore also the shock fronts are 
more azimuthally aligned, sometimes extending around the entire Sun. Thus 
finally the spoke-like structure of different solar wind streams close to the 
Sun is converted into a shell of concentric shock waves propagating outward 
like waves from a stone thrown into water. When CIRs or CIRs and travelling 
interplanetary shocks interact, merged interaction regions result which play 
a crucial role in the modulation of the galactic cosmic radiation. A summary 
of the plasma physical properties of CIRs and their consequences for different 
particle populations in the three-dimensional heliosphere is given in [27]. 

6.3.5 The Heliosphere During the Solar Cycle 

Sunspots strongly modify the photospheric magnetic field, subsequently 
changing the field on the source surface and the tilt angle of the neutral 
line. These modifications are transported outwards even to the borders of 
the heliopause and manifest themselves in spatial and temporal variations in 
solar wind and magnetic field parameters. 

The most dramatic variation of the heliospheric structure during the solar 
cycle is related to the neutral line of the coronal magnetic field and its in- 
terplanetary continuation as the heliospheric current sheet. The waviness of 
the current sheet, as described by the tilt angle, increases towards the solar 
maximum. Thus the current sheet is rather flat during the solar minimum, 
as shown in the left panel in Fig. 6.20, while it extends to much higher lat- 
itudes during solar maximum. During solar minimum the CIRs are confined 
to the vicinity of the equatorial plane while during solar maximum conditions 
stream interactions also can be observed at higher latitudes. 

In the solar wind parameters, the solar cycle variations are less pro- 
nounced [183,473]. In general, any long-term variations apparent in the data 
are small compared with short-term variations. In addition, each solar cycle 
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Fig. 6.20. Waviness of the heliospheric current sheet during solar minimum and 
maximum conditions, based on a sketch by R. Jokipii, University of Arizona 



seems to be slightly different, and thus a parameter might be related to solar 
activity quite well in one cycle but not in another. Despite the uncertain- 
ties involved, there are some correlations which can be understood easily; for 
instance: (a) The average solar wind speed is higher during solar minimum 
than during solar maximum because high-speed solar wind streams are ob- 
served more frequently and for longer times during solar minimum, (b) The 
average solar wind densities are roughly constant except for individual time 
periods when exclusively slow solar wind streams were observed, (c) The mo- 
mentum flux is modulated by ±28% with a well-defined minimum at solar 
maximum, which just is a combination of (a) and (b). For the same rea- 
son, the kinetic energy flux is modulated by ±40%, again with its minimum 
around solar maximum, (d) The relative amount of helium has a minimum 
of 2.8% around solar minimum and a maximum of about 4% around solar 
maximum. Again, this reflects the fact that during solar minimum conditions 
the fast solar wind is observed more frequently. 



6.4 Plasma Waves in Interplanetary Space 

The interplanetary magnetic field is highly variable on different temporal and 
spatial scales. For instance, fast and slow solar wind streams form interaction 
regions. Coronal mass ejections, sometimes driving an interplanetary shock, 
are transient disturbances, and on a smaller scale, waves and turbulence are 
superposed on the average field. 

Figure 6.21 illustrates the variability of the magnetic held on time scales 
of minutes to hours. It shows the magnetic held azimuth (angle between 
magnetic field line and radial direction), the elevation (inclination of the held 
with respect to the plane of ecliptic), and the flux density for a time period 
of 7 hours during unusually quiet (left) and turbulent (right) interplanetary 
conditions. Magnetic held fluctuations are more pronounced in direction than 
in flux density and are quite irregular in amplitude and frequency. 
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Time T/H 

Fig. 6.21. Fluctuations in the magnetic field azimuth, the elevation, and the flux 
density for extremely quiet conditions {left) and during a turbulent phase (right). 
The different azimuth angles indicate an outward direction of the field in the left 
panel and an inward direction in the right panel. Data from the Helios magnetome- 
ter, University of Braunschweig 



6.4.1 Power-Density Spectrum 

The magnetic field fluctuations can be described by a power-density spectrum 
[35,40]: 

/(fc||) = C•fc-^ (6.22) 

Here is the wave number parallel to the field, q the slope, and C a constant 
describing the level of the turbulence. 

The power-density spectrum in Fig. 6.22 shows magnetic field fluctuations 
on different scales. Its slope is distinct in different parts of the spectrum, 
indicating different sources and modes of turbulence. Basically, four regimes 
can be distinguished: 

• Large-scale structures lasting a few days up to a solar rotation are related 
to the stream structure of the solar wind and to solar wind expansion. Both 
processes are the sources of turbulence on smaller scales; the frequencies 
of the large-scale turbulence are below 5 x 10"^ Hz. 

• Meso-scale fluctuations are associated with the flux-tube structure of the 
interplanetary medium which originates in the photospheric supergranula- 
tion. Frequencies range between 5 x 10“^ and about 10“'^ Hz. 

• In the inertial range, mainly Alfven waves with periods between some 
20 min and more than 15 h are found, corresponding to frequencies be- 
tween 10“^ and 1 Hz. The slope q varies between —1.5 and —1.9. Magnetic 
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Fig. 6.22. Magnetic field power- 
density spectrum. Reprinted from 
K.U. Denskat et al. [129], J. Geophys. 
Res. 87, Copyright 1983, American 
Geophysical Union 



field fluctuations in the inertial range seem to be responsible for the scat- 
tering of protons in interplanetary space (Sect. 7.2). 

• The smallest scales are in the dissipation range above 1 Hz. Here the spec- 
trum is steeper with a slope close to —3. The observed fluctuations can be 
attributed to ion cyclotron waves, ion acoustic waves, and Whistlers. 



6.4.2 Waves or Turbulence? 

So far we have described the fluctuations in terms of waves. But a single 
observer in interplanetary space cannot decide whether the fluctuations car- 
ried across him by the solar wind are waves or turbulence because he is 
not able to distinguish between spatial and temporal variations. Thus the 
question of whether the magnetic held fluctuations should be interpreted in 
terms of waves or turbulence has led to a long and sometimes fruitless con- 
troversy [332,525]. Only the modern concepts of MHD turbulence, e.g. [365], 
allowed a kind of uniflcation of both approaches: dynamical MHD turbu- 
lence is not the simple superposition of different waves, but rather consists 
of wave-packets which can interact with each other or can decay and excite 
new waves. 

We will not go into the details of this debate, but only introduce the 
concept of Alfvenic turbulence or Alfvenicity because it offers a helpful tool 
in the description of magnetic held turbulence. Alfven waves are transverse 
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Fig. 6.23. Hourly averages of 
magnetic field and plasma speed 
components. The good correla- 
tion between the plasma and 
field fiuctuations characteristic of 
Alfvenic fiuctuations is evident. 
Reprinted from R. Bruno et al. 
[64], J. Geophys. Res. 90, Copy- 
right 1985, American Geophysi- 
cal Union 



waves propagating along the magnetic field line with the Alfven speed va = 
Bo/y/ixoQ (see Sect. 4.2.1). Fluctuations are Alfvenic if the fluctuations 5ttsowi 
in flow speed and 6B in flux density obey the relation 

^'*^sowi = ^ r— • (6.23) 

In the plasma and held data shown in Fig. 6.23, the good correlation 
between these fluctuations is evident. Fluctuations are classifled as Alfvenic 
if the correlation coefficient is larger than 0.6. Obviously, this is true for 
Alfven waves. But there is also a large number of other fluctuations which 
fulflll (6.23). In particular, structures with variable \B\ can also fulfill (6.23), 
as can many of the static structures in the solar wind. Alfven waves contribute 
only a small amount to the Alfvenic fluctuations. 

The Alfvenicity of fluctuations is useful in the description of the evolution 
of turbulence from an orderly state (high Alfvenicity) to an entirely stochas- 
tic one [525]. For instance, the Alfvenicity is larger close to the Sun than at 
the Earth’s orbit, indicating that in the inner heliosphere most of the fluc- 
tuations are of coronal origin. As these fluctuations decay, the Alfvenicity 
decreases and the slope of the power-density spectrum evolves towards —5/3, 
which is the Kolmogoroff spectrum of random, uncorrelated turbulence. The 
Alfvenicity is larger in fast solar wind streams than in slower ones; thus in 
the fast wind an orderly state is preserved over larger spatial scales. If fast 
and slow streams interact, the Alfvenicity decreases and the spectrum takes 
the slope of the Kolmogoroff spectrum. The Alfvenicity can also be different 
on both sides of the heliospheric current sheet. 
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The solar wind and its magnetic field therefore have to be understood as a 
dynamically evolving, inhomogeneous, anisotropic, turbulent magneto-fiuid. 
With increasing distance, the fiuctuations embedded in this fiuid evolve from 
Alfvenic turbulence close to the Sun towards a Kolmogoroff spectrum. 



6.5 The Three-Dimensional Heliosphere 

Until the early 1990s our knowledge of the heliosphere had been limited to the 
plane of ecliptic. The main goal of the Ulysses mission, launched in October 
1990, is the study of the heliosphere’s third dimension, i.e. plasmas, particles, 
and fields in the polar regions of the Sun. 

The Ulysses trajectory for the prime mission is shown in Fig. 6.24. A 
swing-by at Jupiter allowed Ulysses to escape out of the plane of ecliptic into 
an elliptical orbit around the Sun. This orbit is inclined by 80° relative to 
the solar equator; the orbital period is 6.3 years. Ulysses fiew below the Sun’s 
south pole in autumn 1994 and above her north pole in summer 1995, both 
polar passes were made during solar minimum. The mission will continue for 
another two orbits, allowing for observations over the poles at solar maximum 
and during the following minimum. The most important results of the first 
polar pass are summarized in a series of papers in Science 268 . 

Plasma and field observations offered some surprises for the scientists. 
For instance, the radial component of the magnetic field, which is most eas- 
ily related to the global solar magnetic field, failed to show any latitudi- 
nal gradient [26], although the photospheric magnetic field clearly reveals a 
dipole-like pattern. Thus magnetic flux is removed from the poles towards 
the equatorial regions, as had been suggested in the sketch in Fig. 6.18 and 
is also suggested in the (empirical) concept of overexpansion. This observa- 
tion has consequences for solar wind acceleration models, where the resulting 
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Fig. 6.24. Flight path of Ulysses. After a swing-by at Jupiter in February 1992, 
the spacecraft left the plane of ecliptic 
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stress of the magnetic field thus far has not been considered (except for the 
newer works which also consider the overexpansion), and for models describ- 
ing the modulation of galactic cosmic rays. In addition, the magnetic fiux in 
the southern and northern hemisphere is different with an increase in radial 
magnetic field strength of about 30% in the southern hemisphere [160,493]. 
This suggests an offset of the heliomagnetic equator by about T to the south: 
the solar magnetic field therefore is not symmetrical about the heliographic 
equator. 

The plasma measurements showed a pronounced latitudinal variation of 
the solar wind speed. As shown in Fig. 6.25, it increases from about 450 km/s 
in the equatorial plane to about 750 km/s above the poles. Up to a latitude 
of about 30° S, which corresponds to the tilt angle at that time, there is 
a strong variation between fast and slow streams with a period of about 
26 days, resulting in a recurrent CIR. At latitudes higher then 50°S, only 
the fast solar wind streaming out of the coronal hole is observed [409], in 
agreement with our expectations. The composition of the solar wind, on the 
other hand, offered some surprises and also a big challenge for theory: the 
compositions of the fast and slow streams were markedly different, but the 
abundances were not what would be expected for ions accelerated in the hot 
corona. Instead, they were more representative of ions formed in the lower 
temperatures of the chromosphere [184]. 




Fig. 6.25. Polar plot of solar wind speed as a function of heliolongitude for the 
out-of ecliptic phase of the Ulysses mission from February 1992 through January 

1997. Reprinted from D.J. McComas et al. [342], J. Geophys. Res. 103, Copyright 

1998, American Geophysical Union 
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A comparison between plasma and magnetic field fluctuations inside the 
coronal hole revealed the existence of large-amplitude, long-period Alfven 
waves propagating outward from the Sun [26] . Thus it appears that the fluc- 
tuations, which originate close to the Sun in the acceleration region of the 
solar wind, are less likely to decay in the uniform fast solar wind flow than 
they are in the complex and interacting flows in the plane of ecliptic. 



6.6 The Active Sun 



The variability of the Sun is most obvious in the number and spatial distribu- 
tion of sunspots. But other properties, such as the electromagnetic radiation, 
the solar wind, and the solar and interplanetary magnetic fields, change too. 
And, of course, during times of high solar activity there are the phenomena 
of violent releases of energy and matter, i.e. flares and coronal mass ejections. 

We have already considered a simple model of a sunspot in example 11. 
Figure 6.26 shows a white light image of a sunspot and its surroundings. The 
dark spot, also called the umbra, is surrounded by a penumbra which consists 
of radially oriented filaments. Outside the penumbra, the photospheric gran- 
ulation cells are visible. The darker the umbra, the lower the temperature 
and the greater the magnetic field strength. By use of the Stefan-Boltzmann 
law, the temperature Tgpot inside the spot can be calculated from the ratio 
of the intensity /spot of the electromagnetic radiation inside the spot to that 
for the photosphere /photo- 



/spot 

/photo 



/ ^pot 

V ^photo 



4 



(6.24) 



This temperature difference can be used to calculate the difference in gas- 
dynamic pressure and thus also the magnetic field strength, as demonstrated 
in example 11. 



6.6.1 The Solar Cycle 

The first records of sunspot, the prime indicators for solar activity, date back 
to the fourth century BC when the Greek astronomers noted dark spots on the 




Fig. 6.26. Sunspot with surrounding granules, 
white light image. Source: www.uni-sw.gwdg. 
de/ resear ch/exp_solar/eflecken . html 
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solar surface. Ancient Chinese astronomers knew about sunspots, as did the 
pre- Spanish Peruvians. In Europe, isolated records can be traced back to the 
ninth century, but systematic observations started only with the development 
of the telescope early in the seventeenth century. The variability of shape, 
location, and number of sunspots was recognized early; however, owing to 
the Maunder minimum, a period of very low solar activity in the seventeenth 
century, the solar cycle was recognized only in 1843 by H. Schwabe. 

Figure 6.27 shows two different representations of the solar cycle. In the 
top panel, a butterfly diagram gives the latitudinal distribution of sunspots, 
and in the lower panel, the sunspot number is shown. Alternatively, a sunspot 
relative number or Wolf number can be used which considers the sunspot 
size and its relation to other spots or an active region: R = k (10^ -h /) 
with g being the number of sunspot groups, / the number of single spots, 
and k a normalization factor to standardize observations (e.g. corrections for 
visibility). At the solar minimum, the Sun is almost spotless. Then spots 
start to appear at latitudes around 30° . These spots are relatively stable and 
often can be observed over some solar rotations. They move towards the solar 
equator while at higher latitudes new spots emerge. The number of sunspots 
increases until solar maximum. Afterwards, only a few new sunspots appear 
on the disk while the sunspots at low latitudes dissolve. The total number 
of sunspots decreases. Just after the solar minimum new sunspots begin to 
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Fig. 6.27. Two different representations of the solar cycle. (Top) Butterfly 
diagram of the latitudinal distribution of sunspots, with each bar marking a 
sunspot. (Bottom) Sunspot number versus time for the same time period (source: 
science . mf sc . nasa . gov/ssl/pad/ solar/images/bf ly . gif ) 
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emerge at higher latitudes. The average duration of such a cycle is 11 years 
with variations between 7 and 15 years. 

Prom the lower panel in Fig. 6.27, it is evident that solar activity is highly 
variable between different solar cycles. For instance, the sunspot number 
in the 1958 solar maximum was about twice as large as that in the 1855 
maximum. The highest number of sunspots for a month observed so far was 
254 in October 1957. Over longer periods, variations by up to a factor of 
4 have been observed; at some times, e.g. during the Maunder Minimum of 
1650-1710, solar activity and sunspot numbers can be even smaller. 

The magnetic cycle is twice as long: within the 11 years of a sunspot 
cycle, the solar field reverses its polarity once. This is evident in Fig. 6.28, 
where the magnetic flux is plotted in a butterfly-diagramm. Thus only after 
22 years the original polarity pattern is restored. This 22-year cycle is called 
the Hale cycle or the solar magnetic cycle. 




N73 19(2 1985 19BS WW Nfl? 



Year 

Fig. 6.28. Butterfly diagram of net magnetic flux (constructed from the NSO/KP 
synoptic rotation magnetic maps) from April 1975 to August 1997. Note the 
dominant opposite polarities of the magnetic flux poleward and equatorward in 
the butterfly pattern, reversed between the two hemispheres and between cycles 
21, 22, and 23. Large-scale patterns of monopolar magnetic flux extend pole- 
ward from the activity belts in several “streams”. Those with the polarity of the 
active-region follower fields ultimately result in the reversal of the polar fields. 
The change in the tilt of the Sun’s axis introduces an annual short-term varia- 
tion in the polar fields. (These NSO/Kitt Peak data were produced cooperatively 
by NSF/NOAO, NASA/GSFC, and NOAA/SEC.) Source of figure and caption: 
www.hao.ucar.edu/siiii/SMI_platel.html; see [324] 



6.6 The Active Sun 



173 



6.6.2 A Simple Model of the Solar Cycle 

Evidently, the clue to understanding the solar cycle lies in the magnetic field 
and its reversal. Helpful information, in particular for a proof of models, can 
be found in the details of sunspots: 

1. Sunspots emerge at relatively high latitudes and move towards the equa- 
tor (Sporer’s law). During the solar cycle the latitude of emergence also 
moves towards the equator. 

2. Sunspots are observed in bipolar groups with the leading spot (in the 
direction of apparent motion and closest to the equator) having the same 
polarity as the hemisphere it appeared in while the following spot has 
the opposite polarity (Hale’s polarity law). The bipolar groups in oppo- 
site hemispheres have opposite magnetic orientation and this orientation 
reverses in each new solar cycle. 

3. The tilt angle of the active regions is proportional to the latitude (Joy’s 
law), which is actually a small deviation from Hale’s law. 

The motion of the sunspots reveals another important property of the Sun, 
namely its differential rotation: the Sun rotates faster at its equator and 
slower at the poles with a sidereal rotation time of 26.8° /day at the equator 
and 31.8° at 75° latitude (see Table 6.1). 

The Solar Dynamo — Basic Idea. The source of solar activity is a MHD 
dynamo, as first proposed by Babcock [17]. While the details of this process 
are not completely understood, the underlying principle seems to be valid. 
The dynamo process works within or at the bottom of the convection zone, 
most probably at the tachocline where most of the shear is concentrated. 
During the solar minimum the Sun’s magnetic field is poloidal. Differential 
rotation winds it up to a toroidal field, as already described (Sect. 3.6). The 
magnetic field is then concentrated in flux tubes with radii of a few hundred 
kilometers and magnetic field strengths between a few hundred and about 
2000 G. The bulk motion in the convection zone twists the field lines, locally 
increasing the magnetic field strength to up to some thousands of gauss. This 
high flux density leads to magnetic buoyancy driving up magnetic flux ropes 
through the photosphere (see Fig. 6.29). 

The increased magnetic flux inhibits convection; thus less heat is trans- 
ported towards the photosphere and the regions of high magnetic flux are 
cooler. Where the flux tubes intersect the photosphere, bipolar sunspots 
emerge with the polarity pattern required by Hale’s law. The latitude of 
the first appearance of sunspots is determined by the interplay of differential 
rotation and magnetic field strength. With the emergence of sunspots the 
magnetic pressure locally is reduced and the process continues at lower lati- 
tudes, leading to the motion of sunspots towards the equator during the solar 
cycle. Meridional flows in the convection zone combined with magnetic field 
diffusion and dispersion drive the leading spot towards the equator while the 
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Fig. 6.29. Increased magnetic buoyancy drives 
flux tubes through the photosphere, creating 
two sunspots of opposite polarity. Reprinted 
from K. Lang [308], Sun, earth, and sky, Copy- 
right 1995, Springer- Verlag 



following spot stays behind. The leading spots of the opposite hemispheres 
converge at the equator and dissolve by reconnection. The following spots 
undergo reconnection with the polar fields. Since the polar field is slightly 
smaller than the field accumulated in the following spots, the polarity of 
these spots eventually takes over and the magnetic field is reversed. The pos- 
sibility of this process is supported by the lanes of different than the prevalent 
polarity of the hemisphere at higher latitudes, which can be seen in Fig. 6.28. 

This last step, i.e. the pole reversal, is understood least. An alternative 
explanation for the polarity reversal is the a-effect discussed in Sect. 3.6 that 
creates a toroidal current which in turn gives rise to a poloidal magnetic field 
of opposite polarity. 

The Solar Dynamo — Details. Two important details in the solar cycle 
discussed above are the emerging of magnetic flux in an active region and 
the treatment of the a-effect in different dynamo models. 

The rise of a fiux tube from the tachocline through the photosphere into 
the chromosphere or even the corona is infiuenced predominately by an inter- 
play between magnetic buoyancy, aerodynamic drag, and the Coriolis force. 
For an isolated horizontal fiux, the pressure balance is 

Pi = Pe , (6.25) 

Zlio 

Pi and pe are the internal and external gas pressures and B^/2 /xq is the 
magnetic pressure. Equation (6.25) implies pi < p^. Expressing the pressure 
by the gas law, we can rewrite (6.25) and obtain 

QiRT + - — = QeRT (6.26) 

ZflQ 

or, after rearrangement, 

Qe B 

Qe 2/ioi^e 



(6.27) 
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As a consequence, the fluid in the flux tube has a lower density than its 
surroundings and thus the flux tube must be buoyant. This effect is termed 
magnetic buoyancy. The combination of magnetic buoyancy and aerodynamic 
drag then determines the details of the rise and, later, the emergence of the 
flux tube. The Coriolis force does not influence the lifting of the tube but 
modifles its shape: it twists the rising flux tube from a plane loop into a 
three-dimensional structure where the area enclosed by the loop has an S- 
shape in the vertical direction. The inclusion of the Coriolis force allows us 
to understand Hale’s and Joy’s laws. 

As mentioned in Sect. 3.6, differential rotation and magnetic buoyancy 
lead to the poloidal held and emerging flux in active regions, while the held 
reversal requires an additional process to work, the a-effect. Models using 
the mean-fleld theory differ in their treatment of this a-effect. These models 
include cyclic convection, in which a is positive in the unstable layer and 
negative in the tachocline or overshoot layer below; magnetostrophic waves, 
which imply a negative a; flux loops which correspond to a positive a; and 
unstable global-scale Rossby waves. ^ The various models can be summarized 
as follows [406]: 

• Overshoot layer models (OL dynamos), also called co-spat ial wave models, 
combined with a < 0, give the correct migration direction and thus are 
able to model the butterfly diagrams, but tend to give cycle periods that 
are too short. 

• Distributed wave models (IF dynamos) require an abrupt spatial change 
in diffusivity to excite dynamo waves. So far, strong toroidal flelds can be 
produced in these models. 

• Co-spatial transport models (CP dynamos) can describe the held migration 
in terms of density pumping or advection of the magnetic held but do not 
address the origin of the deep toroidal fleld. 

• Distributed transport models (BL dynamos) start from an entirely different 
position: not a dynamo wave but a conveyor belt mechanism is responsi- 
ble for the emergence and evolution of sunspots as seen in the butterfly 
diagram. These models can reproduce the conflnement of active regions 
to low latitudes and describe the migration patterns of sunspots; however, 
present models require an unrealistically low turbulent diffusivity. 

If we disregard the many problems with the details of the process, how- 
ever, we can nonetheless give a simple formal approximation to the solution 
suggested by Parker [390] that outlines the main features required to describe 

^ A Rossby wave is a standing wave that slowly drifts in a fluid layer in a rotating 
body. The Rossby wave results from an interplay of a thermal or pressure gradient 
that drives a convection cell, and the Coriolis force. A prominent example of a 
Rossby wave is the undulating polar jet in the terrestrial atmosphere that guides 
the pressure regimes and thus is responsible for the weather. A simple laboratory 
experiment to produce a plane Rossby wave uses a rotating cylindrical tank with 
a temperature gradient between the outer wall and the axis. 
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Fig. 6.30. Local Cartesian coordinate system at a 
point in the northern hemisphere of a spherical ro- 
tating body. Note that the coordinate system rotates 
with the body, and thus the unit vectors vary with 
time 



the dynamo and the solar cycle, following [101]. The starting point is (3.135). 
Since we shall deal with average quantities only, we shall omit all brackets 
indicating averages, and B and u are meant to be average quantities. In 
addition, we assume that viscosity is determined by the turbulent viscosity. 
Rearrangement of (3.135) then gives 

— = Vx(«xv) + Vx {aB) + . (6.28) 

We choose a Cartesian coordinate system fixed on the surface of the Sun as 
sketched in Fig. 6.30: the ?/-axis points radially outwards, the x-axis points 
in the toroidal direction (east-west direction), and the z-axis points in the 
direction of increasing latitude. 

To be in agreement with the observed butterfly diagram, we need an 
equatorwards-propagating wave, that is, a wave propagating in the negative 
2 )-direction. In addition, the solution should be symmetric with respect to the 
rotation axis. In a local Cartesian system, this implies d/dy = 0. 

The toroidal magnetic field is then simply ByCy^ while the poloidal field 
lies in the xz-plane. Since this field is solenoidal, it has zero divergence and 
can be written as the rotation of a scalar field A{x^z): 

B = By(x^ z)ey + V X [A(x, z)ey] . (6.29) 



The mean velocity field results from the differential rotation and thus has 
a component in the y-direction: v = Vy{x)ey. The velocity shear then is 
G = dvyjdx. Equation (6.28) can then be written as 

a D 

-^ = - a VM + /? , (6.30) 

where a is constant. The x- and z- components of (6.28) can be written as 



V X 



dA 

dt 



€y - aByBy - /3 V^ACy ) = 0 . 



(6.31) 



This equation can be solved is the following condition is satisfied: 



dA 

dt 



— ocBy -f- A . 



(6.32) 
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Thus, for a field of the form (6.29), the dynamo equation (6.28) is satisfied 
if A and By satisfy (6.30) and (6.32). Thus instead of the dynamo equation 
(6.28), we can solve the two equations (6.30) and (6.32) simultaneously. 

Equation (6.32) describes the evolution of the poloidal field. Without the 
a-term this would be a simple diffusion equation: any poloidal field would 
just diffuse away. The a-term works as a source, generating new poloidal field 
out of the turbulence, since, as discussed in Sect. 3.6, a is a measure of the 
turbulent motion. Equation (6.30) describes the evolution of the toroidal field. 
It has two sources: the first results from the velocity shear of the differential 
rotation, and the second stems from the turbulent motion (as evident from 
the a contained in this term), because just as the helical motion can twist 
the toroidal field to produce a poloidal field, it can also twist a poloidal field 
to produce a toroidal one. If the differential rotation is strong, this second 
term can be neglected and (6.30) reduces to 

dBy ^ dA _ 

^=G^+^V%. (6.33) 

This equation describes the ai? dynamo and is a reasonable approximation 
for the solar dynamo. If differential rotation were weak, the other source term 
in (6.30) would be dominant and we would get an dynamo. 

The ai? dynamo is thus defined by (6.32) and (6.33). These equations 
can be solved by wave- like solutions. The ansatz is 

A = Aq exp{ujt + ikz) and By = Bq exp{u;t -f ikz) . (6.34) 

Substituting in (6.32) and (6.33), we obtain 

{uj -1- f3k‘^)Ao = aBo and (a; -f /3bk^)Bo = —ikGAo (6.35) 

or, combined, 

(a; 4- = -ikaG , (6.36) 

which has the solution 

w = -/3A;2 ± (^) ■ (6.37) 

For maintenance of the dynamo process, the real part of w must be larger 
than zero: 3?(w) > 0. In addition, k is taken to be positive. The crucial term 
is the product aG, which gives the combined effects of helical motion and 
differential rotation. For aG > 0 and 5R(a;) > 0, we obtain from (6.37) 



kaG 



— 1 



kaG 



kaG 



(6.38) 
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We can now define a dynamo parameter 

|aG| 



Na = 



/?2F • 



(6.39) 



From (6.38), we see that the condition for dynamo growth is > 2. The 
eigenmodes of the marginally stable = 2) dynamo, 



A, By ~ exp ^ -i\l t + \kz 



(6.40) 



correspond to waves propagating in the positive 2 :-direction, that is, pole- 
wards. 

For aG < 0 ,we obtain from (6.37) 

w = _ i or KH = -/3A;2 + -y/^l^ . (6.41) 

Dynamo growth is again described by the dynamo parameter (6.39), but the 
marginally stable solutions now become 



A, By ~ exp 



k\aG\ 

2 



t + ikz 



(6.42) 



which correspond to an equatorwards-propagating wave. Thus for aG < 0, 
we obtain a solution of the dynamo equation that accounts for both the 
periodicity and the equatorwards propagation of the solar magnetic field. 

Nonetheless, we should be aware that these solutions have been obtained 
under simplified conditions. In particular, a, /?, and G are assumed to be 
constant, which is certainly not true under realistic conditions. In a realistic 
scenario, one would have to solve the dynamo equation in a finite region 
with suitable boundary conditions. In addition, we have assumed that the 
mean velocity u is purely in the toroidal direction. However, with a suitable 
velocity field, an equatorwards-propagating wave can be obtained even if 
aG>0 [102]. 



6.6.3 Stellar Activity 

Owing to its close proximity, the Sun is the only star whose surface we can 
see directly. Thus we are able to identify such features as sunspots, filaments, 
flares, and solar cycles, all indicating that the sun is a magnetically active star. 
Magnetic activity is not unique to the Sun; it is not even strong on the Sun. 
Other stars can show much stronger magnetic activity. Although spots and 
flares cannot be observed directly, the Ca II line, which is a strong indicator 
of the magnetic network on the Sun, can be studied in other stars, too. This 
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has led to the identification of magnetic activity in other stars [25,429]. Cyclic 
variations and also magnetically fiat stars could be identified. Since many of 
these stars have shorter star cycles than the Sun, it could be shown that the 
variations in the solar cycle are small compared with the variations in the 
magnetic cycles of comparable stars. The other active stars, therefore, are 
often used to infer information about long-term variations in solar activity 
[326]. 

A large number of articles on various aspects of solar and stellar activity 
can be found in [389,466,564]. 



6.7 Flares and Coronal Mass Ejections 

Flares and coronal mass ejections (CMEs) are violent manifestations of solar 
activity. Both are related to the solar magnetic field, sunspots, and filaments. 
The energy released in these processes had been stored in the field. 

The first record of a solar flare dates back to Carrington in 1859, who 
observed a sudden brightening of a sunspot in white light. After this “ex- 
plosion” , the sunspot structure had changed and about a day later a violent 
geomagnetic storm with strong auroral activity was observed. Although Car- 
rington himself noted “a swallow does not make a summer” , this was the first 
direct link between solar activity and its influence on Earth. Coronal mass 
ejections only have been observed since the early 1970 with the advance of 
space-borne coronographs. Today, one of the most controversial topics is the 
relation between flares and CMEs. 

Flares normally are associated with active regions and sunspots. Nonethe- 
less, even during sunspot minimum on a spotless Sun sporadic flare events 
of large magnitude can occur. Such flares observed during solar minimum in 
general are associated with erupting filaments rather than with sunspots and 
active regions. In addition, there tend to be fewer flares per solar rotation at 
times of the highest sunspot number during a solar maximum [302]. For a 
review on the magnetic nature of solar flares see [423]. 

In some cases, flares occur repeatedly in the same location and display 
similar spatial structures during the evolution of the active region. Such flares 
are called homologous [179,566]: only part of the energy stored in the mag- 
netic field is released and the overall topology of the field is retained. These 
flares are not necessarily small because the energy release depends on the 
available magnetic energy and magnetic stability. If enough energy is stored 
in the field even the release of only a relatively small fraction of it can lead 
to a considerable flare. 

Some major flares also can trigger flares in their neighbourhood or even in 
remote active regions. Such flares are called sympathetic, see e.g. [361,570]. 
Sympathetic coronal mass ejections also can be observed [362]. 
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6.7.1 Electromagnetic Radiation 

A flare is the result of a sudden violent outburst of energy, with energies of 
up to 10^^ J being released over a time period of some minutes. White-light 
flares, as the one observed by Carrington, are rare because even for the largest 
flares the brightness is less than 1% of the total luminosity of the photosphere. 
In certain frequency ranges, e.g. at the wings of the black body, the intensity 
of the electromagnetic radiation can increase by orders of magnitude during 
a flare. While the flare is defined as an outburst in electromagnetic radiation, 
often it also is associated with the emission of energetic particles (Sect. 7.2) 
and huge plasma clouds, the CMEs. 

The electromagnetic radiation released in a flare in different frequency 
ranges shows typical time profiles (see Fig. 6.31). These profiles can be used 
to define the phases of a flare during which distinct physical processes occur. 

In a large flare, the electromagnetic emission can be divided into three 
parts. In the preflare phase, also called the precursor, the flare site weakly 
brightens in soft X-rays and Ha. This phase lasts for some minutes; it is 
observed in very large flares only, and indicates a heating of the flare site. 
During the impulsive or flash phase, most of the flare energy is released and 
the harder parts of the electromagnetic spectrum, such as hard X-rays and 
7-rays, are most abundant. This phase lasts for a few minutes and can be 
followed by a gradual or extended phase during which emission mainly occurs 
in Ha and soft X-rays but microwave and radio emission also can continue. 
This latter phase can last for some tens of minutes, and occasionally even for 
a few hours. It is present in the larger events only. Note that most flares are 
rather small, consisting of an impulsive phase only. 

The impulsive phase is related to an impulsive energy release, probably 
reconnection, inside a closed magnetic field loop in the corona, heating the 
coronal plasma and accelerating particles. The heated plasma emits soft X- 
rays. Accelerated electrons generate microwaves and hard X-rays at the top of 
the loop or hard X-rays, 7-rays, UV emission, and part of the Ha emission at 
its footpoints, while accelerated ions generate 7-ray line emission. Streaming 
electrons also generate radio emission. Thus the electromagnetic radiation 
also provides information about the acceleration and propagation of particles 
accelerated in a flare. 

Soft X-Rays and Ha. In a solar flare most of the electromagnetic radiation 
is emitted as soft X-rays with wavelength between 0.1 and 10 nm. Soft X- 
rays originate as thermal emission in hot plasmas with temperatures of about 
10^ K. Most of the radiation is continuum emission; lines of highly ionized O, 
Ca, and Fe are also present. In a large flare, soft X-rays are emitted during 
all three phases; the strong increase at the beginning of the impulsive phase 
is related to an abrupt increase in the temperature at the flare site up to 
about 5 X 10^ K. The Ha emission also is thermal emission, its time profile 
closely follows the soft X-ray profile. 
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Fig. 6.31. Solar electromag- 
netic radiation during the dif- 
ferent phases of a flare and 
frequency spectrum of radio 
bursts, based on [278] 



Hard X-Rays. Hard X-rays are photons with energies between a few tens 
of kiloelectronvolts and a few hundred kiloelectronvolts, generated as brems- 
strahlung of electrons with slightly higher energies [279,304,535]. Only a very 
small amount of the total electron energy (about 1 out of 10^) is converted 
into hard X-rays. 

Microwaves. Solar microwave emission is generated by the same electron 
population as the hard X-rays, as can be deduced from the similarities in 
the time profiles [34,264,379]. As in hard X-rays, during the impulsive phase 
the emission often consists of individual spikes, the ‘elementary flare bursts’. 
Microwave emission is gyro-synchrotron radiation of electrons with energies 
between some 10 keV and some 100 keV [430,515]. When these mildly rel- 
ativistic particles gyrate in the coronal magnetic field (about 20-100 G), 
they emit radiation with frequencies between 10 and 100 times their gyro- 
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frequency. This dependence on the magnetic field strength has a remarkable 
consequence: as the magnetic field decreases with height, microwave emission 
at a certain frequency, say 17 GHz, is generated by electrons with energies 
above 200 keV in low lying fiares or small loops, while in a loop extending 
high into the corona an electron energy of more than 1 MeV is required [296] . 
While in the impulsive phase the elementary bursts give evidence for indi- 
vidual, isolated energy releases, in the gradual phase the microwave emission 
most probably is thermal emission. 

7-Rays, y-ray emission indicates the presence of energetic particles. The 
spectrum can be divided into three parts: (a) Bremsstrahlung of relativistic 
electrons and, to a lesser extent, the Doppler broadening of closely neighbored 
7 -ray lines leads to a 7 -ray continuum, (b) Nuclear radiation of excited CNO- 
nuclei leads to a 7 -ray line spectrum in the range 4 to 7 MeV. (c) Decaying 
pions lead to 7 -ray continuum emission above 25 MeV. The details of these 
mechanisms are described in [370,371,431] 

The most important 7 -ray lines are at 2.23 MeV and 4.43 MeV. The 2.23 
MeV line is due to neutron capture in the photosphere: '^He-nuclei decay in 
p/a or a/p interactions in the corona, emitting neutrons. These reactions 
require particle energies of at least 30 MeV/nucl. The neutrons can propa- 
gate independently of the coronal magnetic field. Elastic collisions decelerate 
neutrons penetrating into the denser regions of the chromosphere or photo- 
sphere. Eventually, the neutron is slowed down to thermal energies and can 
be captured by or ^He. Neutron capture by leads to the emission of a 
7 -quant. The 4.43 MeV line results from the transition of a nucleus from 
an excited into a lower state, with the excitation being either due to nuclear 
decay or inelastic collisions with energetic particles. 

Radio Emission. Electrons streaming through the coronal plasma excite 
Langmuir oscillations. Solar radio bursts are metric bursts; their wavelengths 
are in the meter range. In interplanetary space, radio bursts are kilometric 
bursts. The bursts are classified depending on their frequency drift [43,351]. 
The type I radio burst is a continuous radio emission from the Sun, basically 
the normal solar radio noise but enhanced during the late phase of the flare. 
The other four types of bursts can be divided in fast and slow drifting bursts 
or continua (see the upper panel in Fig. 6.31). 

The type III radio burst starts early in the impulsive phase and shows a 
fast drift towards lower frequencies. Since the frequency of a Langmuir oscil- 
lation depends on the density of the plasma (see (4.52) and example 17), the 
radial speed of the radio source can be determined from this frequency drift 
using a density model of the corona. The speed of the type III burst is about 
c/3, it is interpreted as a stream of electrons propagating along open field lines 
into interplanetary space. Impulsive peaks in the hard X-ray emission can be 
related to individual type III bursts, indicating individual energy releases. 
Occasionally, the drift of the type III burst is suddenly reversed, indicating 
electrons captured in a closed magnetic field loop: as the electrons propagate 
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upward, the burst shows the normal frequency drift which is reversed as the 
electrons propagate downward on the other leg of the loop. 

In the metric type II burst, the frequency drift is much slower, indicating 
a radial propagation speed of its source of about 1000 km/s. It is interpreted 
as evidence of a shock propagating through the corona. Nonetheless, it is not 
the shock itself that generates the type II burst but the shock-accelerated 
electrons. As these electrons stream away from the shock, they generate small, 
type Ill-like structures, giving the burst the appearance of a herringbone 
(herringbone burst) in the frequency time diagram with the type II as the 
backbone and the type III structures as fish-bones. The type II burst is split 
into two parallel frequency bands, interpreted as forward and reverse shocks. 

The metric type IV and V bursts are continuous emission directly fol- 
lowing the type II and type III bursts, respectively. The type IV burst is 
generated by gyro-synchrotron emission of electrons with energies of about 
100 keV. It consists of two components: a non-drifting part generated by elec- 
trons captured in closed magnetic field loops low in the corona, and a propa- 
gating type IV burst generated by electrons moving in the higher corona. The 
type V burst is a similar burst following the type III burst. But in contrast 
to the type IV burst it is stationary, showing no frequency drift. Most likely, 
it is radiation of the plasma itself. 

Kilometric radio bursts in interplanetary space are interpreted in the same 
way: type III bursts show a fast frequency drift, indicating electrons stream- 
ing along a magnetic field line. If the location of the radio source can be 
identified, this kilometric type III burst can be used to trace the shape of the 
interplanetary magnetic field line [282]. The kilometric type II burst gives 
evidence for a shock propagating through interplanetary space [83] . 

Solar Quakes Produced by Large Flares. An only recently discovered 
by-product of flares is a circular wave packet emanating from the flare site. 
This wave was first observed by the Michelson Doppler Imager (MDI) on 
board SOHO during the large flare of June 1996 [295]. This magneto-acoustic 
wave can be interpreted as a kind of solar quake, containing about four orders 
of magnitude more energy than the 1906 San Francisco earthquakes. The 
waves of this quake were similar to surface waves on a pond produced by a 
stone. The waves accelerated from 10 km/s to about 115 km/s during their 
outward propagation until they finally disappeared in the photosphere. 

These solar quakes should not be confused with Moreton waves. The lat- 
ter are the chromospheric component, seen in Ho; radiation, of a solar-flare- 
induced wave that propagates away from the flare site at a roughly constant 
speed of about 1000 km/s. Moreton waves are attributed to fast-mode MHD 
shocks generated in the impulsive phase of a flare. 

6.7.2 Classes of Flares 

Not all the phases indicated in Fig. 6.31 can be observed in all flares. Instead, 
flares differ in their electromagnetic radiation, in the acceleration of energetic 
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Table 6.2. Classes of solar flares 



Impulsive Gradual 



Duration of soft X-rays 
Decay constant of soft X-rays 
Height in corona 

Volume 10^® 

Energy density 
Size in Ho; 

Duration of hard X-rays 
Duration of microwaves 
Metric type II burst 
Metric type III burst 
Metric type IV burst 
Coronal mass ejection 



< I h 


>I h 


[387] 


< 10 min 


>10 min [104,387] 


< 10^ km 


5 • 10"* km 


[387] 


10^’’ cm® 10^® - 


- 10®® cm® 


[387] 


high 


low 


[387] 


small 


large 


[22] 


<10 min 


>10 min 


[383] 


<5 min 


>5 min 


[119] 


75% 


always 


[84] 


always 


50% 


[84] 


rare 


always 


[84,292] 


rare 


always 


[84] 



particles, and the association with a coronal mass ejection. Standard classi- 
fications based on the magnetic structure and the energy release are given 
in [271,435,542]. 

A useful, although frequently modified, classification scheme for solar 
flares goes back to Pallavicini et al. [387] who used Skylab soft X-ray im- 
ages of the Sun, combined with intensity-time profiles. If a flare is observed 
on the solar limb, the height profile of the electromagnetic emission can be 
inferred. These limb flares can be divided into three distinct groups: (a) point- 
like flares, (b) flares in small and compact loop structures, and (c) flares in 
large systems of more diffuse loops. Flares of classes (a) and (b) are associated 
with a short duration of the soft X-ray emission, less than one hour, while in 
flares of class (c) the soft X-ray emission can last for some hours. Therefore, 
the compact and point-like flares are called impulsive, and the flares in the 
large diffuse loop are called gradual flares. 

The classification scheme, originally introduced for the soft X-rays only, 
over the years has been extended to other ranges of electromagnetic radiation, 
as summarized in Table 6.2. These schemes do not always agree. On the basis 
of the times scales, a flare might appear gradual in soft X-rays but impulsive 
in hard X-rays or vice versa. These phenomenological criteria provide no 
sharp separation into two classes but rather a continuous transition from 
more impulsive to more gradual flares. A better criterion, also pointing to 
the physical difference, is the occurrence of a coronal mass ejection, leading 
to an unambiguous classification of flares into confined (corresponding to 
impulsive) and eruptive (corresponding to gradual). 

We have to be careful not to confuse the classes of flares, i.e. impulsive 
and gradual, with the phases of flares bearing the same name [23] . An impul- 
sive flare appears to be rather simple in so far as it always has an impulsive 
phase. However, in some small events, which are observed in interplanetary 
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space as so-called ^He-rich events [239], even the impulsive phase is rather 
small with the Ha flare often too small to be detected although hard X-ray 
and/or radio emission is observed [437]. Some large impulsive events can also 
show a small gradual phase. In the larger flares (longer duration, larger volu- 
mina) the situation is even more complicated. While the gradual phase is well 
developed, an impulsive phase is not always present, only the largest gradual 
flares show all three phases. The gradual flares with a gradual phase only 
in general are small in electromagnetic emission, their main characteristic is 
the coronal mass ejection. These solar events are often called disappearing 
filaments because the expelled matter is their main signature while classic 
flare emission is weak or absent. 

6.7.3 Coronal Mass Ejections 

With the aid of a coronograph, the corona can be observed continuously. 
Basically, a coronograph is a telescope with an occulter screening oflF the 
direct photospheric emission. Ground-based coronographs have been in use 
since the 1930s, observing only selected coronal emission lines. Space-based 
coronographs, on the other hand, observe the light scattered by the corona. 
The first space-based coronograph was used on Skylab in 1973/74; the most 
advanced coronograph is on SOHO [63,161]. While the older coronographs 
had a field of view from a height of about 1.5 Tq out to 5 or 10 tq, the 
combination of different telescopes in the LASCO coronograph on SOHO has 
a field of view from 1.1 Tq out to about 30 Tq. In addition, its resolution 
is much better, thus smaller and fainter mass ejections have been detected. 
Examples and further details can be found at sohowww.nascom.nasa.gov/ 
gallery/LASCO/ or lasco-www.nrl.navy.mil/lasco.html. 

The most striking feature visible in a record of coronograph images is 
the coronal mass ejection (CME). A coronal mass ejection is a bright struc- 
ture propagating outward through the corona, as shown in Fig. 6.32. Large 
data bases on CMEs exist from the Solwind- Coronograph on P78, the HAO- 
coronograph on Solar-Maximum Mission (SMM), and the LASCO corono- 
graph on SOHO. The basic features of CMEs can be summarized as fol- 
lows [69,236,246,369,501]: 

Solar Cycle Dependence. During solar maxima, about two CMEs are 
observed daily, whereas during solar minima one CME is observed per week 
[551]. This is not too surprising because CMEs are related to flares and 
filaments which both are more frequent during solar maximum. With the 
better spatial resolution of the LASCO coronograph these numbers increase, 
however, the ratio between solar maximum and minimum does not change. 
Latitude Distribution. Coronal mass ejections are distributed evenly on 
both hemispheres, the average latitude is 1.5°N. Their distribution is flat 
within ±30° and decreases fast towards higher latitudes. The maximum in 
the ±30° region reflects the latitudinal distribution of sunspots and flares. 
During solar minimum, the CMEs cluster within ±10° around the equator. 
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Width. The projected widths of CMEs show a distribution with an average 
at 46° and a median at 42°. CMEs smaller than 20° or larger than 60° are 
rare; however, the largest angular extent is more than 120°. The width of 
CMEs seems to be independent of the solar cycle. 

Speeds. CME speeds range from less than 10 km/s up to greater than 
2000 km/s with an average of 350 km/s and a median at 285 km/s. The 
speeds of CMEs do not depend on the solar cycle. Fast and slow CMEs seem 
to reveal different patterns of energy release: a slow CME can accelerate in 
the coronograph’s field of view, indicating a continuous energy release. A fast 
CME, on the other hand, does not show evidence for acceleration. Here the 
energy release appears to be more explosive. 

Kinetics. During a CME between 2 x 10^^ g and 4 x 10^^ g coronal material 
is ejected with a kinetic energy content between 10^^ J and 6 x 10^^ J. This is 
comparable with the energy released as electromagnetic radiation in a flare. 
The combined potential and kinetic energy of the CME is at least comparable 
with the entire energy released in a flare. 

Structure. Textbook coronal mass ejection are loop-like structures as shown 
in Fig. 6.32. While these are the most impressive and also the most energetic 
CMEs, other morphologies exist, too, with such picturesque names as spikes, 
multiple-spikes, clouds, fans, or streamer blow-outs [236]. The basic difference 
is a smaller extent and a structure distinct from a closed loop. One example 
will be discussed in connection with Fig. 6.36. 

Note that all the geometrical quantities discussed above are apparent 
quantities only: while the coronal mass ejection is a three-dimensional struc- 
ture, its image is only a two-dimensional projection into a plane perpendicular 
to the Sun-Earth axis. Thus sizes and speeds might be underestimated. In 
particular, if the CME propagates directly towards the observer, its extent 
and speed cannot be determined. It is even more difficult to detect because it 
only becomes visible as a halo around the Sun after it has spread far enough. 




Fig. 6.32. Image of the 4 May 1986 CME, 
taken by the HAO coronograph. The dark 
disk in the lower left corner is the oc- 
culting disk inside the coronograph, and 
the dashed circle gives the photosphere. 
The arrow in the center of the Sun points 
towards the north. Note that this CME 
is observed during solar minimum condi- 
tions. Reprinted from S.W. Kahler and 
A.J. Hundhausen [263], J. Geophys. Res. 
97, Copyright 1992, American Geophysi- 
cal Union 
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Owing to this projection effect, the three-dimensional structure of a CME 
is debatable: it is not clear whether the leading bright structure is a loop 
or a bubble. Here observations from two spacecraft from different positions 
are required, such as planned with the STEREO mission. For details of the 
mission see stp.gsfc.nasa.gov/missions/stereo/stereo.htm, details of 
the coronograph can be found at wwwsolar.nrl.navy.mil/STEREO/index. 
html. 

6.7.4 Coronal Mass Ejections, Flares, and Coronal Shocks 

The speeds of coronal mass ejections are highly variable. In Fig. 6.33 the 
range of CME speeds is compared with the Alfven and the sound speed. 
In the corona, most CMEs are too slow to drive a fast MHD shock wave. 
Nonetheless, because many of the CMEs are still faster than the sound speed, 
they might drive slow or intermediate MHD shocks. In at least one CME the 
curvature of the loop suggests a slow shock [248]. 

In Sect. 6.7.1 we have learned about the metric type II burst, interpreted 
as a shock wave propagating through the corona. The relationship between 
type II bursts and CMEs is ambiguous. From a statistical study of CMEs and 
metric type II bursts [476], it is evident that about two-thirds of the metric 
type II bursts are accompanied by a fast CME. However, there are also metric 
type II bursts without CMEs (one-third) as well as CMEs without metric type 
II burst (three-fifth). In particular, half of the CMEs without type II bursts 
are fast CMEs, with speeds above 450 km/s. The situation becomes even 
worse if CMEs and type II radio bursts are compared in individual events. 
In some events, the CME’s radial speed is markedly lower than the speed 




. (ABOVE PHOTOSPHERE) 



Fig. 6.33. Alfven and 
sound speed in the corona 
and in interplanetary space 
compared with the solar 
wind speed vsw and speeds 
of coronal mass ejections 
(CMEs), coronal type II 
bursts, and interplanetary 
type II bursts. Reprinted 
from J.-L. Bougeret [57], 
in Collisionless shocks in 
the heliosphere: reviews of 
current research (eds. B.T. 
Tsurutahi and R.G. Stone), 
Copyright 1985, American 
Geophysical Union 
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of the radio burst; in other events the source of the metric type II emission 
is located behind the CME, occasionally overtaking it at later times. Thus 
a CME is neither a necessary nor a sufficient condition for a coronal shock. 
Nonetheless, the more energetic CMEs most likely drive a coronal shock. 

The relationship between flares and coronal mass ejections also is a hotly 
debated topic. Traditionally, the CME has been viewed as a phenomenon 
accompanying large flares. Today, it is suggested that the CME is the primary 
energy release while the flare is just a secondary process [200]. Both models 
have their pros and cons, currently we probably should follow the suggestion 
in [152] that the flare neither is the cause nor the consequence of the CME 
but that both are triggered by a common mechanism, probably an instability 
(see below). The observations leading to this statement are as follows. 

Flares and CMEs can occur together; however, both also can occur sepa- 
rately: about 90% of the flares are not accompanied by a CME, while about 
60% of the CMEs go without a flare. The combined flare and CME events 
are the most energetic events in both groups. In these events the flare, which 
is small compared with the angular extent of the CME, is not necessarily 
centered under the CME but more likely is shifted towards one of its legs. 

The prime argument for the CME being the cause and the flare the con- 
sequence is based on energetics: the energy released in the CME is larger 
than the one released in the flare. But the mechanism of the energy release is 
different, too: if a CME is accompanied by a flare, it has a high and constant 
speed, indicative of an explosive energy release. A CME without flare, on the 
contrary, often accelerates, indicating that energy is released continuously. 

Timing is another crucial factor in this discussion: in about 65% of the 
combined CME/flare events, the CME leads, while in the other 35% the flare 
starts before the CME. 

Combined, all these observations suggest that it might be difficult to view 
flares and CMEs in terms of cause and consequence and they favour a picture 
of a common trigger. 

6.7.5 Models of Coronal Mass Ejections (CMEs) 

Many CMEs originate in filaments, and the magnetic field pattern of the 
filament can even be recognized if the CME is detected in interplanetary 
space, see, for example, [56]. 

So far we have learned that magnetic pressure prevents the filament from 
“falling down” to the photosphere. But how and why can it suddenly be 
blown out so violently to form a CME? A detailed analysis of the magnetic 
field pattern of the filament and the photosphere reveals two different config- 
urations. In the normal configuration (Kippenhahn and Schliiter [286], K-S 
configuration), the magnetic field inside the filament has the same direction 
as the photospheric field below it (left panel in Fig. 6.34). We have already 
used this model in example 13. The inverse configuration (Raadu and Ku- 
perus [427,428], R-K configuration), which is shown in the right panel, is 
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Fig. 6.34. Magnetic field configura- 
tions in solar filaments: (left) normal 
configuration, also called K-S config- 
uration, (right) inverse configuration, 
also called R-K configuration 



more complex: here the magnetic field inside the filament is directed oppo- 
site to the one in the photosphere. In particular, in the large, high rising 
filaments, which tend to give rise to CMEs, the R-K configuration seems to 
be dominant. 

The crucial feature is obvious in the R-K configuration: below the filament 
there is an X-point configuration where magnetic fields of opposite polarity 
can be found in close proximity. Such a location is favorable for reconnection. 
The configuration might have been stable for a long time; however, the motion 
of the magnetic field lines anchoring the filament or a slow rise of the filament 
due to increased buoyancy can lead to the sudden onset of reconnection at 
the X-point. Then the magnetic field energy is converted into thermal energy 
and flow energy, leading to a further rise of the filament. At neighboring X- 
points, reconnection sets in, too, ripping off the filament from its anchoring 
structure and blowing it out as a CME. A filament of the K-S type can be 
expelled by the same mechanism, only the forces acting on the anchoring field 
lines must be larger to create a X-point configuration below the filament. 

In the classical model of a filament, reconnection always takes place be- 
tween the two legs of each field line. The resulting field configuration therefore 
is a closed loop below the filament and a toroidal field line around it. A dif- 
ferent situation arises if the legs of neighboring field lines merge, as sketched 
in Fig. 6.35. As the filament (gray area) lifts owing to some instability, recon- 
nection sets in. Since neighboring anchoring field lines reconnect, the filament 
is surrounded by a helical magnetic field. The plasma in front of the erupting 
structure is compressed. As it flows towards the trailing edge of the erup- 
tion, vortices are generated behind the arcade core. The vortices drive the 
plasma inwards and compress the current sheet below the filament. Therefore 
the current density increases to a value where kinetic plasma instabilities are 
excited. The increased resistivity leads to a higher magnetic diffusivity and 
a new reconnection line below the filament, leading to the formation of a 
secondary plasmoid below the original filament. 

In this three-dimensional reconnection [202,325], open field lines extend- 
ing into interplanetary space can also be involved. Thus, particles accelerated 
in the reconnection region can easily escape into interplanetary space. In ad- 
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Fig. 6.35. 3-D reconnection below an idealized filament. Figure taken from B. 
Vrsnak [542], in Lectures on solar physics (eds. H.M. Antia, A. Bhatnagar, and P. 
Ulmschneider), Copyright 2003, Springer- Verlag 



dition, the three-dimensional reconnection might evolve along the filament 
more slowly, leading a slow CME and a more continuous energy release. 

The expulsion of a filament is suitable for explaining the loop-shaped 
CMEs. Other types of CMEs, however, require different geometries. One ex- 
ample is shown in Fig. 6.36: reconnection in the tip of a helmet streamer. Its 
outer portion consists of opposing magnetic fields. As somewhere on the Sun 
new magnetic flux emerges, the coronal magnetic field is deformed and at the 
current sheet reconnection sets in, blowing out an open magnetic field struc- 
ture along the streamer. In the coronograph image, such a CME is seen as a 
spike- or jet-like structure. Although operating on a larger scale, this mech- 
anism is the same as in the blobs of high-density slow solar wind discussed 
above. 

6.7.6 Models of Flares 

We can expand the above model of a loop-like CME to accommodate a flare. 
By definition, this would be an eruptive or gradual flare. Figure 6.37 shows 
again a filament in the inverse configuration. As reconnection sets in at the X- 
point, three different phenomena occur: (a) The plasma is heated, leading to 
thermal emission in the soft X-ray and visible ranges, (b) Particles are acceler- 
ated, either streaming upward along field lines extending into interplanetary 
space (solar energetic particles. Sect. 7.2) or downward producing the hard 
electromagnetic radiation, (c) The filament breaks loose and is ejected as a 
CME. If the CME is fast enough, it drives a shock wave. Particles accelerated 
at this shock escape into interplanetary space along open held lines. Because 
the reconnection occurs on all anchoring held lines, the flare occupies a large 
volume which extends rather high into the corona, thus fulfilling the criteria 
of a gradual flare as summarized in Table 6.2. 

Large confined flares might be explained by the same model, the only 
difference would be in point (c): although reconnection sets in at the X- 
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Fig. 6.36. Disconnection event: a 
CME with an open magnetic field 
structure is expelled along a helmet 
streamer. Reprinted from D.J. Mc- 
Cormas [343], Geophys. Res. Lett. 
18, Copyright 1991, American Geo- 
physical Union 



point of one or a few anchoring fieldlines, the energy release is not large 
enough to break all connections between filament and photosphere. Thus 
the filament is not ejected and the flare is confined to a smaller volume. 
Such a configuration in which the filament lifts but fails to detach and the 
overall magnetic topology is retained is a suitable candidate for a series of 
homologous flares. 

In the point-like flares, which also are confined or impulsive flares, the 
observations of energetic particles in interplanetary space (Sect. 7.2) suggest 
a different scenario, as sketched in Fig. 6.38. Particles are accelerated inside 
a closed loop, giving rise to electromagnetic emission. The loop is very stable, 
preventing particles from escaping into interplanetary space. As the particles 
bounce back and forth in the closed loop, they excite electromagnetic waves 
which can propagate in all directions, interacting with the ambient plasma, 
even accelerating particles. If these “secondary” particles are accelerated on 
open field lines, they can escape into interplanetary space. Since the acceler- 
ation requires particles and waves to be in resonance, diflFerent particles are 
accelerated by different types of waves. If a particle species is common in 
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Fig. 6.37. Simplified model of a 
large eruptive flare. The energy re- 
lease occurs in the reconnection re- 
gion below the filament, leading to 
heating, particle acceleration, and a 
CME. The heated plasma and the 
energetic particles give rise to elec- 
tromagnetic emission in various fre- 
quency ranges 



the corona, such as H and ^He, the waves in resonance with these particles 
are absorbed more or less immediately; thus these particles predominately 
are accelerated inside the closed loop and therefore do not escape into inter- 
planetary space. Other waves, however, travel larger distances before being 
absorbed by the minor constituents, such as ^He and the heavy ions, and 
are thus more likely to accelerate these species on open field lines. Since the 
escaping particle component is selectively enriched in these minor species, 
this acceleration process is called selective heating. 




Fig. 6.38. Model of an impulsive ^He-rich flare. 
Particles accelerated in a closed loop excite waves 
which in turn accelerate particles on open field 
lines. These latter can escape into interplanetary 
space 
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6.7.7 Magnetic Clouds: CMEs in Interplanetary Space 

Coronal mass ejections in interplanetary space still carry the magnetic field 
patterns from their parent filament. These closed magnetic field structures, 
also called magnetic clouds, have features different from the ambient medium. 

Figure 6.39 shows the magnetic field and plasma data for a typical mag- 
netic cloud. Vertical lines indicate the boundaries of the cloud and the shock 
wave driven by it. The typical signatures of a magnetic cloud can be sum- 
marized as follows: (a) a decrease in magnetic field strength inside the cloud, 
(b) a rotation of the magnetic field vector, in particular its elevation, (c) de- 
creases in plasma density, plasma speed, plasma temperature, and therefore 
plasma-/?, and (d) a bi-directional streaming of suprathermal electrons back 
and forth along the length of the cloud (not shown in the figure). 

The magnetic field configuration of such clouds can be inferred from the 
variation in magnetic field elevation: at the beginning of the cloud the mag- 
netic field is almost perpendicular to the plane of ecliptic. Inside the cloud the 
elevation decreases until at the end the magnetic field vector is almost oppo- 
site to the one at the beginning. This is indicative of a magnetic field wrapped 
around the ejecta as sketched in Fig. 6.40. In this picture the magnetic cloud 
is sketched as a bundle of twisted magnetic field lines. The direction of field 
rotation varies from cloud to cloud, refiecting the field configuration of the 
parent filament. Just how long the magnetic cloud stays connected to the Sun 
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Fig. 6.39. Field and plasma data 
for an interplanetary shock and the 
magnetic cloud driving the shock. 
From top to bottom: magnetic field 
flux density, elevation, azimuth, so- 
lar wind speed, plasma density, and 
proton plasma temperature. The 
vertical lines indicate the shock 
and the boundaries of the mag- 
netic cloud. From L.F. Burlaga [70], 
in Physics of the inner heliosphere, 
vol. II (eds. R. Schwenn and E. 
Marsch), Copyright 1991, Springer- 
Verlag, Berlin 
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Fig. 6.40. Proposed 
topology of a magnetic 
cloud in interplanetary 
space. Reprinted from 
L.F. Burlaga [70], in 
Physics of the inner 
heliosphere, voL II (eds. 
R. Schwenn and E. 
Marsch), Copyright 
1991, Springer- Verlag 



is still an open question, so therefore the dashed lines in Fig.6.40 indicate the 
possibility but not the necessity of such a continued connection. 

Magnetic clouds are the main cause for geomagnetic disturbances, their 
geomagnetic effectiveness depends on whether the field at the leading edge 
has a strong northward or southward component (Sect. 8.5.2). 



6.7.8 Interplanetary Shocks 

In the event in Fig. 6.39 a shock has been observed in front of the magnetic 
cloud. But only about one-third of the CMEs in space drive an interplanetary 
shock [199], while apparently all travelling interplanetary shocks are driven by 
CMEs, although the magnetic cloud is not necessarily detected if the observer 
is located at the flank of the shock. Most shocks are observed around the solar 
maximum. 

Interplanetary shocks are identified by characteristic changes in the 
plasma and field parameters, in particular a sudden increase in plasma den- 
sity, speed, and temperature, and a jump in the magnetic field strength. 
Figure 6.41 shows two examples of shocks in the Helios data. From top to 
bottom, plasma temperature, plasma density, solar wind speed, and magnetic 
field strength are shown. Both examples can be identified best by the sudden 
jump in magnetic field strength. Here it is also obvious that each of the shocks 
consists of a forward shock (marked by a dashed line) and a reverse shock 
(marked by an arrow). Owing to the rather poor temporal resolution the 
shocks are more difficult to identify in plasma data. However, in the example 
on the right, the jumps in the plasma parameters at the forward shock are 
obvious. This is a fast, strong shock with a local shock speed of 1181 km/s. 
The example on the left, although the jump in magnetic field is by the same 
factor, is a slow, rather weak shock with a speed of 508 km/s, only slightly 
above the solar wind speed. 

The properties of interplanetary shocks are highly variable. Between 
0.3 AU and 1 AU, the basic characteristics are as follows: 




6.7 Flares and Coronal Mass Ejections 



195 




Fig. 6.41. Two examples for shocks observed by Helios 1. Vertical dashed lines 
mark the arrival of the forward shock; the arrows mark the reverse shock. Solar 
wind data from the MPAe Lindau experiment on board Helios, magnetic field data 
from the University of Braunschweig magnetometer on board Helios 



• The compression ratio varies between 1 and 8 with an average close to 2. 

• The magnetic compression (ratio between the upstream and downstream 
magnetic field strengths) varies between 1 and 7 with an average at 1.9. 

• Shock speeds in the laboratory frame vary between 300 km/s and 700 km/s 
with an average of about 600 km/s. Occasionally, shock speeds above 
2000 km/s can be observed. Obviously, shocks with speeds only slightly 
above 300 km/s can be observed in very slow solar wind streams only. 
Since the shock speed is lower towards the flanks, the tongue-like shape 
of the shock front closely resembles the shape of the leading edge of the 
magnetic cloud, as shown in Fig. 6.40. 

• The angular extent of the shock varies between a few tens of degrees and 
up to 180°; the shock is always wider than the driving CME. 

• The Alfven Mach number is between 1 and 13 with an average at 1.7. 

The shock parameters, of course, are related to the properties of the CME, 
such as speed, angular extent, and total energy released. 

An interplanetary shock is a disturbance propagating into the expanding 
solar wind. The shock should develop absolutely because it expands, and also 
relative to the ambient medium as the latter expands differently. In particular, 
the expansion of the shock leads to a decrease in the plasma and magnetic 
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Fig. 6.42. Radial variation of the shock 
speed from the Sun to 1 AU. If the CME is 
very fast close to the Sun, shock and CME 
slow down during propagation. This is gener- 
ally the case when the CME is accompanied 
by a flare. If the CME is slow close to the 
Sun, as in case of a disappearing filament, 
the propagation speed is roughly constant 



flux densities. Thus the energy density also decreases. But the latter decreases 
not only because of the shock’s expansion: turbulence created in the wake of 
the shock and particles accelerated at the shock front (Sect. 7.6) also reduce 
the shock’s energy. 

Changes in shock parameters with radial distance can be quite different 
from one shock to another. As an example, in Fig. 6.42 the radial variation 
of the shock speed is shown. Two extreme cases can be distinguished. If a 
shock is very fast close to the Sun (with CME speeds above 1000 km/s), 
it is likely to decelerate in interplanetary space. On the other hand, shocks 
that are rather slow on the Sun do not decelerate but propagate at roughly 
constant speed. Possible interpretations can be found in the energy release 
mechanism: it is more explosive in fast shocks and CMEs, which in general 
are also accompanied by a flare, compared with rather continuous in the 
slower ones. In addition, the faster shocks in general tend to be more efficient 
particle accelerators, and thus part of the shock’s kinetic energy is converted 
into kinetic energy of particles. In some sense this relates to the different 
speed characteristics of the CMEs and to the conversion of shock kinetic 
energy into particle energy. 



6.8 Shock Waves 

A shock is a discontinuity separating two different regimes in an otherwise 
continuous medium. It is associated with something moving faster than the 
signal speed in the medium: a shock front separates the Mach cone of a 
supersonic jet from the ambient, undisturbed air. Here the disturbance and 
the shock are moving, and thus the shock is called a travelling shock. Standing 
shocks also form: in a river, a shock forms in front of the bridge pier where 
the fast stream suddenly is slowed down. In space plasmas, both kinds of 
shocks exist: mass ejections propagating from the Sun through interplanetary 
space drive travelling shocks. The supersonic solar wind is slowed down at 
planetary magnetospheres, forming the bow shock, a standing shock wave. 
At these discontinuities the properties of the medium change dramatically. 
We can define a shock as follows: 
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1. The disturbance propagates with a speed faster than the signal speed. In 
a gas, the signal speed is the speed of sound; in space plasmas, it depends 
on the Alfven speed and the sound speed. 

2. At the shock front, the properties of the medium change abruptly. In a 
hydrodynamic shock, pressure and density increase; in a magnetohydro- 
dynamic shock, plasma density and magnetic field strength increase. 

3. Behind the shock front a transition back to the properties of the undis- 
turbed medium must occur. Behind a gas-dynamic shock, density and 
pressure decrease; behind a magnetohydrodynamic shock, plasma den- 
sity and magnetic field strength decrease. If this decrease is fast, a reverse 
shock develops. 

Shock waves can be stable for long times (in the solar system up to some 
months) and can propagate even out to the boundary of the solar system. 

While the study of gas-dynamic shocks started in the late nineteenth 
century and had its heyday in the 1940s, the study of plasma shocks started 
only in the fifties as an interest in fusion plasmas and the consequences of 
nuclear explosions in the atmosphere awoke. At that time, also a certain kind 
of shocks in a plasma has been detected that differed strongly from the gas- 
dynamic shock: in these collisionless shocks, densities are too low to allow for 
collision between individual atoms or molecules. Instead, the collective effects 
of the electrical and magnetic properties of the plasma allow for frequent 
interactions and the formation of a shock wave. 

Collisionless shocks therefore are different from gas-dynamic shocks. 
Nonetheless, the concepts about the fundamental nature of shocks are the 
same as in a gas-dynamic shock, as are the basic conservation laws. 



6.8.1 Information, Dissipation, and Non-linearity 

A shock is a non-linear wave of “permanent” form propagating faster than 
the signal speed. Thus an understanding of a shock has to invoke the concepts 
of information, dissipation, and non-linearity. 

Information can be transferred by a propagating disturbance; the sound 
wave is the simplest example. It can transfer information either as a contin- 
uous stream of different waves (as in language or music) or as a rather sharp 
pulse like the “bang” following an explosion or the “clap” in hand-clapping. 
Though the information contained in these latter signals might be more diffi- 
cult to decipher, they are more useful for the explanation of shock formation 
because they are wave parcels with a well-defined onset. These sounds travel 
as pressure pulses through the air. Their distance from the source defines the 
information horizon: inside the information horizon, the signal has already 
been received, outside it has not yet been detected. 

A sound wave is a compressional wave: the density increases with increas- 
ing pressure. Sound waves are “simple” waves. The compression is assumed 
to be adiabatic: the gas is compressed such that on expansion it returns to 
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its original state. This is possible because the compression is fast enough to 
prevent thermal conduction from removing heat. Thus the compression is 
isentropic, the entropy does not change. Furthermore, we assume that the 
disturbance is small and therefore viscosity, friction, and heat conduction 
are negligible, and that the gas behaves according to the ideal gas law. We 
then can calculate the sound speed according to (4.40). It is independent of 
frequency but depends on the gas parameters, for instance the temperature. 

The creation of a supersonic disturbance can then be viewed from different 
perspectives: we can ask ourselves whether information can travel faster than 
the speed of sound, we can study a disturbance moving faster than sound, or 
we can study a large-amplitude disturbance. 

Let us start with the latter case. A large-amplitude pressure pulse can be 
created by an explosion. In air, close to the site of the blast, the sound speed 
can increase up to about 1000 km/s. But the signal does not propagate as a 
harmonic wave: in the compressional phase the pressure amplitude can ex- 
ceed the atmospheric pressure. During decompression, however, the pressure 
cannot drop below zero. Thus the amplitudes of the positive and negative 
half-waves are different.^ In addition, in a large-amplitude wave the change 
in sound speed during compression and decompression becomes significant: 
during compression the temperature increases and the wave can propagate 
faster, while during the other half-wave the temperature decreases, leading 
to a slower propagation. Thus the wave front steepens in time, similar to a 
water wave running into shallow water. If the steepening eventually leads to 
a jump in density and pressure, a shock wave has formed. This kind of shock 
is called a blast wave shock. Type II radio burst in the solar corona probably 
indicate blast waves (Sect. 6.7.1). Blast wave shocks are often used in the 
numerical simulation of interplanetary shocks [138,245,494], although today 
it is realized that interplanetary shocks are driven shocks. 

A driven shock is associated with an object moving faster than a sound 
wave. In this case, even a small-amplitude disturbance can lead to the pres- 
sure and density jump that defines a shock. An object moving through air 
transfers momentum and energy to the ambient molecules. The motion of 
the object also requires motion of the air: molecules in front of the object 
must give way to it by streaming around the object and again collecting be- 
hind it. But to do this, the molecules must first receive information about 
the approaching object. The pressure pulse in front of a subsonic vehicle 
provides this information. For a supersonic vehicle, the information also has 
to be provided, even if only for a short distance ahead. Thus the medium 
has to be changed to allow for the faster propagation of information. As the 

^ The same argument holds in an even more expressive example, the shallow water 
wave which steepens until it finally breaks and spills over. Although the water 
wave as a surface wave requires a slightly different description it should be kept 
in mind as an illustrations because it is more likely to be related to everyday 
experience than an acoustical blast wave shock. 
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motion starts to become supersonic, the initially small-amplitude pressure 
pulses pile up in front of the vehicle, leading to a large- amplitude distur- 
bance. This large- amplitude disturbance is able to change the properties of 
the medium irreversibly: after the disturbance has passed by, the medium 
will be in a diflPerent state. Since the disturbance will change the temperature 
of the medium it also will change the sound speed. 

6.8.2 The Shock’s Rest Frame 

The simplest description of a gas-dynamic shock uses the shock’s rest frame 
(see Fig. 6.43): gas with a speed larger than the signal speed is flowing into 
the shock from the upstream medium which so far has not received any 
information about the approaching shock. Since this side is not modifled by 
the shock, it is also called the low-entropy side. At the shock front, irreversible 
processes lead to the compression of the gas and a change in speed: across the 
shock front, mass conservation is required, and thus the amount of matter 
flowing through each element of the shock surface has to be constant. The 
flow out of the shock front into the downstream medium is subsonic and the 
density is increased. Thus we can define a shock as an entropy-increasing or 
irreversible wave that causes a transition from subsonic to supersonic flow. 

Theoretically, the disturbance driving the shock can propagate at any 
speed. Therefore, the shock’s propagation speed can increase without any 
limit. The Mach number M is defined as the ratio between the shock speed 
in the upstream medium and the sound speed. It is always determined in 
the rest frame of the shock. In the upstream medium, M is larger than 1, 
in the downstream medium it is smaller. Thus in the downstream medium 
the plasma leaves the shock with a speed smaller than the sound speed: any 
disturbance in the downstream medium can propagate away from the shock. 

We can now easily understand that a travelling shock and a standing 
shock are identical: in a travelling shock a supersonic disturbance propagates 
through the medium while in a standing shock the object is at rest and the 
flow is supersonic. Thus the’ difference between standing and travelling shocks 
depends on the frame of reference only; the systems are Galilean invariant. 
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Fig. 6.43. Frame of reference for the de- 
scription of a shock. The shock front is at 
rest, plasma flows with a high speed Uu 
from the upstream medium into the shock 
front and leaves it with a lower speed Ud 
into the downstream medium 
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6.8.3 Collisionless Shock Waves 

In a gas-dynamic shock, the important process is the collision between 
molecules: they establish a temperature distribution, temperatures of dif- 
ferent species are equalized, density and temperature fluctuations can prop- 
agate, and the viscous forces associated with them lead to dissipation. 

Space plasmas are rarefled, and thus collisions are rare. These shocks 
are called collisionless shocks. The lack of collisions has some implications, 
for instance: electrons and protons can have different temperatures, their 
distributions can be very different from a Maxwellian making the classical 
concept of temperature obsolete, the presence of a magnetic field might even 
lead to highly anisotropic particle distributions, and processes of dissipation 
involve complex interactions between particles and fields. 

Nonetheless, shocks are frequently observed in space plasmas. While the 
coupling between the particles due to collisions is negligible, the magnetic field 
acts as a coupling device, binding the particles together. We have already used 
this assumption in magnetohydrodynamics (MHD) where we have described 
a magnetized plasma by concepts such as pressure, density, and bulk velocity. 
These concepts also prove helpful in the description of the plasmas upstream 
and downstream of the shock. The details of the shock front and the plasma 
immediately around it, however, cannot be covered within this framework 
because MHD does not consider the motions of and the kinetic effects due 
to individual particles. While we are still far from understanding the details 
of these processes, observations indicate that the collective behavior of the 
plasma is mainly due to wave-particle interactions. Thus collisionless shocks 
are an example of a macroscopic flow phenomenon regulated by microscopic 
kinetic processes. A popular account can be found in [455], a discussion of 
laboratory experiments to produce collisionless shocks is given in [135]. 



6.8.4 Shock Conservation Laws 

Plasma properties in the upstream and the downstream media are different 
in parameters such as bulk flow speed tx, magnetic field B, plasma density 
and pressure p. The relationship between these two sets of parameters is 
established by basic conservation laws, the Rankine-Hugoniot equations. 

Rankine-Hugoniot Equations in Ordinary Shocks. A very clear de- 
scription of the Rankine-Hugoniot equations and their application to hydro- 
and aerodynamic shocks is given in [108]. These equations describe the con- 
servation of mass, energy, and momentum through the shock front, and can 
even be applied to simple shocks in space plasmas if the distributions are 
isotropic Maxwellians and the magnetic field is roughly parallel to the flow. 

In these conservation laws the shock is assumed to be infinitesimally thin. 
In optics, a boundary is thin with respect to the wavelength and thick with 
respect to the spacing of the molecules in the crystal structure. Analogously, 
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in MHD a boundary is thin with respect to the scale length of the fluid 
parameters (if waves are involved, as in a shock, it is thin with respect to 
the wavelength) but thick with respect to the Debye length and the ion gyro- 
radius, both being characteristic for the collective behavior of the plasma. 

In the remainder of this section the abbreviation [X] = — X^ gives the 

difference of a quantity X in the upstream and the downstream media. The 
Rankine-Hugoniot equations for a gas-dynamic shock then are: 

• conservation of mass: 

[mUn] = [£>Un] = 0 ; (6.43) 

• conservation of momentum normal to the shock: 



[quI + p] = 0 ; 

• conservation of momentum tangential to the shock: 



= 0 ; 



• conservation of energy normal to the shock: 

.2 



QU 



2 ^7a 



7a 



P Un 



= 0 . 



(6.44) 



(6.45) 



(6.46) 



Here u is the flow speed, Un {ut) the flow speed normal (tangential) to the 
shock, Q the density, p the pressure, and 7a the specific heat ratio, all measured 
in the shock’s rest frame. The conservation of energy considers both kinetic 
flow energy and internal energy. Combining (6.43) and (6.44) yields [ut = 0]: 
the tangential component of the flow is continuous. Therefore, we can choose 
a coordinate system moving along the shock front with the speed Ut. In this 
normal incidence frame (see left panel in Fig. 6.44), u equals u^. 

The mass conservation (6.43) can be used to estimate the local shock 
speed. Making a Galilean transformation into the laboratory system, it can 
be written as [^(t^s — '^n)] = 0. Rearrangement gives the shock speed 



Qd'^n,d Qu'^n,u 
Qd ~~ Qu 



(6.47) 



In applying (6.47) to shocks in space plasmas, in particular travelling inter- 
planetary shocks, we should be aware of its limitations. First, the magnetic 
field is neglected; the shock is a simple gas-dynamic one. Second, the shock is 
assumed to be spherically symmetric with the flow perpendicular to the shock 
surface. For an interplanetary shock, this is an oversimplification [244, 472] 
and the speed estimated from (6.47) only gives the radial component of the 
shock speed. It therefore can be used as a lower limit only. However, observa- 
tions suggest that the deviation of the shock normal from the radial direction 
is often less than 20° [89], and thus (6.47) is a reasonable approximation. 
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Note that the shock speed alone is not indicative of the energetics of the 
shock. The shock speed becomes large if the denominator in (6.47) is small. 
Thus a small increase in density across the shock can often be associated 
with a high shock speed, while the total energy in terms of compression and 
mass motion is rather small. 

Example 21. For the shocks in Fig. 6.41, the following plasma parameters can 
be determined from the figure: for DOYs 217 and 268, we obtain upstream 
densites 30 cm~^ and 14 cm"^, downstream densities 60 cm~^ and 105 cm~^, 
upstream speeds 360 km/s and 602 km/s, and downstream speeds 420 km/s 
and 1101 km/s, respectively. From (6.47) we then obtain local shock speeds of 
480 km/s and 1204 km/s; the compression ratios Vn are 2 and 7.3, respectively. 
Both shock speeds are good approximations to the more accurate values given 
in the figure. □ 



Rankine-Hugoniot Equations in MHD Shocks. The crucial difference 
between a MHD shock and an ordinary shock is the magnetic field. Thus 
we have to expand the conservation laws to also accommodate the field. 
In addition, the geometry becomes more complex because the flow is not 
necessarily parallel to the field. 

Often a special rest frame is used, the de Hoffmann-Teller frame (right 
panel in Fig. 6.44). In the normal incidence frame, the upstream plasma flow 
is normal to the shock and oblique to the magnetic field. The downstream flow 
is oblique to both the magnetic field and shock normal. In the de Hoffmann- 
Teller frame [127], the plasma flow is parallel to the magnetic field on both 
sides of the shock and the u x B induction field in the shock front vanishes: 
the reference frame moves parallel to the shock front with the de Hoffmann- 
Teller speed vht x B = —E. 

For a MHD shock, the Rankine-Hugoniot relations can be inferred in the 
same way as in a gas-dynamic shock [58,68, 121]. With n being the unit 
vector along the shock normal, the Rankine-Hugoniot equations are: 

• the mass balance, which is the same as for the ordinary shock, 

[gu • n] = 0 ; (6.48) 
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Fig. 6.44. Frames of reference for MHD shocks: normal incidence frame (left) and 
de Hoffmann-Teller frame (right) 
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momentum balance, where the additional terms describe the magnetic pres- 
sure perpendicular and normal to the shock front, 



/ X / {B n)B 

Qu[u-n) + {p-\- - — n 

2/^0 / /^o 



= 0 ; 



(6.49) 



energy balance, where the additional terms describe the electromagnetic 
energy flux E x B/fiQ with the electric held expressed hy E = —v x B, 



u • n 



^ _L 
2 “^ 7-1 



7 B^ 

p+ — 
Mo 



{B -n)(B’ u) 

Mo 



= 0 ; 



(6.50) 



• Maxwell’s equations 

[J3 • n] = 0 , (6.51) 

which follows from VB = 0, and states that the normal component of the 
magnetic fleld is continuous (Bn = const), and 



[n X (u X J5)] = 0 , 



(6.52) 



which states that the tangential component of the electric fleld must be 
continuous. 

The Rankine-Hugoniot equations are a set of flve equations for the unknown 
quantities Bn, and Bt. 



6.8.5 Jump Conditions and Discontinuities 

The Rankine-Hugoniot equations allow the calculation of the downstream 
plasma parameters from the knowledge of the upstream parameters of a 
MHD shock. But these conservation equations are more general; the solutions 
of (6.48)-(6.52) are not necessarily shocks, instead a multitude of different 
discontinuities can be described, too. 

A contact discontinuity does not allow for a plasma flow across it, and thus 
it is Un = 0. It is associated with an arbitrary density jump while all other 
quantities remain unchanged. The magnetic fleld has a component normal to 
the discontinuity (Bn ^ 0), and thus the two sides of the discontinuity are 
not completely decoupled but tied together by the fleld such that they flow 
together at the same tangential speed Ut. 

A tangential discontinuity separates two plasma regions completely from 
each other. There is no flux across the boundary (un = 0 and Bn = 0) and 
the tangential components of both quantities change ([?/t] 7^ 0 and [Bt] 7^ 0). 
Plasma and fleld change arbitrarily across the boundary but a static pressure 
balance is maintained: [p + B^/2/xq] = 0. Tangential discontinuities thus are 
examples for pressure balanced structures. Typical changes in plasma and 
fleld parameters are sketched in the left panel in Fig. 6.45. 
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Fig. 6.45. Changes in magnetic field and plasma parameters across a tangential 
discontinuity (left) and a rotational discontinuity (right). Here ax indicates a change 
in the direction of the quantity x. ptot gives the total pressure, the sum of kinetic, 
plasma, and magnetic pressures, based on [36] 



A rotational discontinuity can be viewed as a large-amplitude wave. In an 
isotropic plasma, the field and the flow change direction but not magnitude. 
The rotational discontinuity requires pressure equilibrium according to (6.49). 
Because there is a flux across the boundary, we get Un 7 ^ 0 and 7 ^ 0. The 
normal flow speed is = B^/yJpLoQ and the change in tangential flow speed 
is related to the change in tangential magnetic held: [ut] = [Bt/-y/jl^]. Thus 
normal flow speed and the change in tangential flow speed are directly related 
to the Alfven speed and the change in tangential Alfven speed. The rotational 
discontinuity therefore is closely related to the transport of magnetic signals 
across the boundary. The jump conditions for a rotational discontinuity also 
apply at boundaries suitable for reconnection. Typical changes in plasma and 
field parameters are sketched in the right panel in Fig. 6.45. 

6.8.6 Shock Geometry 

One important parameter in the description of a MHD shock is the local 
geometry, i.e. the angle Obh between the magnetic field direction and the 
shock normal. Shocks can be classified according to ^Bn- 

• a perpendicular shock propagates perpendicular to the magnetic field: 
0Bn = 90°; 

• a parallel shock propagates parallel to the magnetic field: ^Bn = 0 °; 

• an oblique shock propagates at any Obh between 0 ° and 90° . Oblique shocks 
can be subdivided into 

• quasi-parallel shocks with 0° < Obu < 45° and 

• quasi-perpendicular shocks with 45° < Obu < 90°. 

The shock shown in Fig. 6.44 therefore is a quasi-parallel shock. 
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6.8.7 Fast and Slow Shocks 

A shock differs from the discontinuities described in Sect. 6.8.5 in so far 
as there is a flow of plasma through the surface (un 7 ^ 0 ) combined with 
compression and changes in flow speed. Note that in a parallel shock Bt equals 
0 and the magnetic field is unchanged by the shock, while in a perpendicular 
shock the normal component of the magnetic field vanishes, = 0 , and 
both plasma pressure and field strength increase at the shock. The parallel 
shock therefore behaves like a gas-dynamic shock - except for the fact that 
the collective behavior of the plasma is regulated by the magnetic field and 
not by collisions. 

In a plasma, different modes of MHD waves exist which can steepen to 
form a shock: fast, slow, and intermediate waves (see Fig. 4.4). Of these 
waves, only the fast and the slow waves are compressive. The intermediate 
wave is purely transverse with the velocity perturbation perpendicular to 
both ko and Bq. The intermediate shock, sometimes also called an Alfven 
shock, only exists in an anisotropic medium. In an isotropic plasma, such 
as the solar wind, it is not a shock but a rotational discontinuity: there is a 
rotation of the magnetic field by 180° in the plane of the shock but no density 
jump across the shock. Thus there is a flow across the boundary, but without 
compression or dissipation. In addition, the planes defined by the magnetic 
field and the plasma flow direction in the upstream and downstream media 
are not parallel but oblique. In an intermediate shock the propagation speed 
parallel to the magnetic field equals the Alfven speed, i.e. uint = uacos^Bh- 

Real shocks are formed by fast and slow magneto-sonic waves only. In 
both modes, the plasma density and pressure change across the shock. The 
phase speed of these modes is (see (4.49)) 

2vLt,siow = ± 1 / + v\y - 4vlvl cos 2 e , (6.53) 

with the + sign referring to the fast and the — sign to the slow mode. If 
these waves propagate perpendicular to then ?;inter = '^siow = 0 and 
Vfast = propagation parallel to B either t^fast equals t^inter 

for va > Vs or uinter equals Ugiow for va < Vs (see Fig. 4.4). For the different 
modes, different Mach numbers can be introduced: Ma is the Alfven Mach 
number, Mcs the sonic Mach number (the same as the Mach number in a sim- 
ple gas-dynamic shock), and Mgi and Mf the slow and fast Mach numbers, 
respectively. 

The change in the magnetic field is different in fast and slow shocks: in a 
fast shock the magnetic field increases and is bent away from shock normal 
because the normal component of the field is constant. The normal compo- 
nent of the upstream (downstream) flow speed is larger (smaller) than the 
propagation speed of a fast MHD wave and both upstream and downstream 
flow speeds exceed the Alfven speed. In a slow shock, the upstream speed 
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Fig. 6.46. Change in magnetic field di- 
rection across a fast and a slow MHD 
shock 



exceeds the sound speed but not the Alfven speed. In addition, the mag- 
netic field strength decreases across the shock and the field therefore is bent 
towards the shock normal (see Figs. 6.46 and 6.47). 

Travelling interplanetary shocks in general and planetary bow shocks in 
particular always are fast MHD shocks. So far, only a few slow shocks have 
been observed in situ in the solar system [90,441]. In the solar corona, how- 
ever, slow [248] and intermediate [502] shocks might be more common because 
both the Alfven and sound speed are much higher (see Fig. 6.33). 

For many aspects, in particular shock formation and particle acceleration 
at the shock, the crucial quantity is not the shock speed but its component 
^s|| sec ^Bn parallel to the magnetic field. A rather slow disturbance can 
still have a large propagation speed parallel to B if Obu is large enough. To 
form a shock, a disturbance must propagate with Ug|| > va to catch up with 
the waves propagating along the field, but it must not necessarily propagate 
with Vs > V A- This problem becomes evident in the definition of the Alfven 
Mach number. In a stationary frame of reference, the Alfven Mach number 
is defined as 

Ma = , (6.54) 

VA 

which is the shock speed in an upstream reference system relative to the 
Alfven speed. With this definition, even fast MHD shocks occasionally can 
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Fig. 6.47. Changes in magnetic field and plasma parameters across a fast (left) 
and a slow {right) shock. Based on [36] 
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have Mach numbers smaller than 1. A better definition, such as the critical 
Mach number 

Me = , (6.55) 

VA COS 6>Bn 

is formally more helpful because Me = 1 exactly gives the intermediate shocks 
while fast shocks always have Me > 1. But to determine the critical Mach 
number, the local geometry Obu must be known. 

6.8.8 The Coplanarity Theorem 

In Fig. 6.44 we have tacitly assumed that the shock normal and the magnetic 
field directions in the upstream and the downstream media all lie in the same 
plane. This assumption is called the coplanarity theorem and is a consequence 
of the jump conditions at the shock. It can be expressed as 

n • (Bd X Bu) = 0 . (6.56) 

If we only consider the transverse component, the momentum balance (6.49) 
for an isotropic pressure p can be written as 

QUnUt - = 0 . (6.57) 

L 2/io J 

Equation (6.52) can be written as [u^Bt — B^Ui] = 0. Therefore both [Bt] 
and [unBt] are parallel to [ut] and thus also parallel to each other. Then we 
have [unBt] x [unBt] = 0. Resolving the parentheses gives 

(Un,u - ^^n,d)(Bt,u X Bt,d) = 0 . (6.58) 

Since [i^n] does not vanish, the upstream and downstream tangential mag- 
netic components must be parallel to each other. Thus the upstream and 
downstream magnetic field vectors are coplanar with the shock normal vec- 
tor and the magnetic field across the shock has a two-dimensional geometry. 
The bulk velocity is coplanar with the shock, too. 

6.8.9 The Shock Normal Direction 

An application of the coplanarity theorem is the calculation of the shock nor- 
mal in observational data. If the shock normal is known, the angle 0Bn, which 
is crucial for shock formation and particle acceleration, can be calculated. 

If only magnetic field measurements are available, the coplanarity theorem 
(6.56) for the magnetic field can be used. Since the magnetic field is diver- 
genceless, we have (Bu - Bd) • n = 0. Thus together with (6.56) we have 
defined two vectors perpendicular to the shock normal. These vectors can be 
used to calculate the shock normal: 

(Bu X Bd) X (Bu - Bd) 

KBuXBd)x(Bu-Bd) 



(6.59) 
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This method does not work if is parallel to The shock normal derived 
according to (6.59) is called the coplanarity normal. If also three-dimensional 
plasma measurements are available, other constraints, in particular the copla- 
narity theorem for the bulk velocity, can be used to determine the shock nor- 
mal. In addition, different methods can be combined into one overdetermined 
solution and solved for a best-fit shock normal [453] . 

With the known shock normal we are also able to determine the shock 
speed Vs more accurately than suggested by (6.47): 



Vs = n 

Qd ~ Qn 



(6.60) 



Example 22. Let us briefly return to example 21. From the details of the 
magnetic held, a colleague has inferred the shock normal and the direction of 
the upstream and downstream plasma flows relative to it. For simplicity, the 
shock normal is given here as (1,0), with the x-component in the direction of 
shock propagation. For the shock of DOY 217, the direction of the upstream 
flow is (0.94, 0.34), and that of the downstream flow is (0.98, 0.17); the shock 
is therefore almost quasi-parallel as expected from the weak compression in 
the magnetic held. The shock speed then is 



V. = 



60x|0f^)x420-30 x(0;94jx 360 ^^ 



60-30 



km/s = 485 km/s , 
(6.61) 

which is slightly above the values of 480 km/s determined in example 21. □ 



6.9 What I Did Not Tell You 

In the previous chapters, we have basically dealt with the “well-defined” top- 
ics of physical concepts. In this chapter, we have encountered “real-world” 
physics in a complex environment. We have used our concepts to describe the 
observations; however, we should be aware that the basic concepts describe 
only the general features of the natural phenomena - to understand the de- 
tails and interpret the observations correctly, we need more advanced models 
(Chap. 12). 

But before we can turn to advanced models, we have to be aware of the 
limitations of our measurements. For instance, all in situ measurements suffer 
from one basic problem: interplanetary space is a three-dimensional medium 
which is highly variable in space and time. Thus, formally, all our variables 
are fields e{r,t) varying in time. What we observe is a time series of the 
parameter e at a varying position inside the held because, in general, both 
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the field (for instance, the magnetic field convected outwards with the solar 
wind) and the observer move. Many of the ideas described in this chapter are 
therefore based on sparse evidence. 

We have already mentioned one example: in the interpretation of magnetic 
field fiuctuations, we are not able to distinguish between waves and turbulence 
because the observed variations in plasma and magnetic-field parameters are 
a mixture of temporal and spatial variations. But a one-point observation is 
not only unable to resolve the nature of the local fiuctuations, it is also unable 
to resolve large-scale structures. For instance, the latitudinal distribution of 
the solar wind speed shown in Fig. 6.25 is not a snapshot but sampled over 
almost five years, because it took Ulysses that long to complete one orbit. 
In addition, the solar wind speeds are measured at radial distances between 
1.3 and 5 AU, again a consequence of Ulysses’ orbit. This sampling is not 
necessarily a disadvantage, because in this case all sampling occurred during 
a solar minimum and thus the figure can be interpreted as a measure for 
average solar minimum conditions in the intermediate heliosphere between 
1 and 5 AU. However, if we want to take a look at shorter time scales, 
the figure is of limited use because it ignores all the short-term variability. 
As a consequence, not only do we have to have our data but we also need 
metadata, that is all relevant information about the data, instrumentation 
and observational practice. 

In this chapter we have also learned about an ambiguous method, that 
of classification. Classification is useful because it helps us to focus on com- 
mon aspects of different events/individuals and thus prevents us from getting 
lost in details. But as long as we do not understand the physical differences 
between the classes of events, our classification scheme will be based on phe- 
nomenological criteria. Since the average height of males exceeds that of fe- 
males, it might be worthwhile to discuss such a classification approach with 
a 162 cm male and a 186 cm female to learn about its limitations. 

The phenomenological classification scheme has another disadvantage: 
with each new mission, it is at risk of becoming obsolete. For instance, in 
this chapter, the occurrence of a CME has been mentioned as a possible 
physical basis for the classification into impulsive and gradual flares. The 
observations with the LASCO coronograph, however, showed that there were 
more CMEs than believed previously and that many of the impulsive events 
were accompanied by CMEs too. Thus, taking the occurrence of CMEs as a 
criterion, we would get a different classification scheme, which would not be 
in agreement with the original classification schemes. But CMEs also show 
differences among themselves, in particular with respect to their spatial ex- 
tent, their speed, and their ability to drive a shock. A distinction of CMEs 
into different groups (again a classification) might help to support the origi- 
nal classification scheme, and thus it is still valid that I have introduced this 
accepted standard in this book - but you should be aware that this clas- 
sification is temporary, that it varies, and that, depending on the author’s 
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understanding, the terms “impulsive” and “gradual” are sometimes used with 
slightly different meanings. 



6.10 Summary 

The heliosphere is structured by the solar wind and the frozen-in magnetic 
field, which is wound up into Archimedian spirals due to the Sun’s rotation. 
The fast solar wind originates in coronal holes at the poles while the slow wind 
originates from the streamer belt close to the solar equator where sunspots, 
filaments, and active regions are located. Where fast and slow solar wind 
streams meet, corotating interaction regions form. Fluctuations on different 
scales are superimposed onto the average field. They are related to waves orig- 
inating in the corona, interactions between different solar wind streams, and 
transient disturbances such as magnetic clouds and travelling interplanetary 
shocks. The latter stem from coronal mass ejections, i.e. violent expulsions 
of huge plasma clouds. 



Exercises and Problems 

6 . 1 . Explain the basic conservation laws across a shock front. What are the 
differences between a gas-dynamic and a hydrodynamic shock? 

6 . 2 . Explain the differences between a fast and a slow shock. 

6 . 3 . Derive (6.14) for the length of the Archimedian spiral. 

6 . 4 . Consider a 10 MeV proton in interplanetary space. Determine its gyro- 
radius, its gyration period, and the wave numbers of the Alfven waves in 
resonance with the proton (assume three different pitch angles, 10°, 30°, and 
90°). Compare with the corresponding values for a 1 MeV electron. 

6 . 5 . For an observer on the Earth, calculate the length of the magnetic field 
line to the Sun and its longitude of origin (connection longitude). Do the 
same for an observer at 5 AU. (Assume a plane geometry with the field line 
confined to the plane of ecliptic.) 

6.6. Imagine a slow solar wind speed starting on the Sun. 30° east of this 
stream, a fast stream with twice the speed of the slow stream originates. 
Where would they meet? (Simple assumption of an Archimedian magnetic 
field spiral.) 

6 . 7 . An electron beam with c/3 propagates through the interplanetary plasma 
and excites a radio burst (see example 17). Assume a decrease in plasma 
density ~ 1 /r^ . Calculate the frequency drift under the assumption that the 
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electron beam propagates radially. How do these results change if the cur- 
vature of the Archimedian field line, along which the electrons propagate, is 
considered? How would the curvature of the field line infiuence the frequency 
drift of a type II burst in front of an interplanetary shock? 

6.8. A magnetic loop on the Sun has a parabolic shape with B = ^o(l + 

with H = SO 000 m being the height of the loop and s the distance 
from the top of the loop. Calculate the bounce period of particles with a speed 
of 2c/3. As the particles interact with the atmosphere at the mirror points, 
they create hard X-rays. What is the time interval between two subsequent 
elementary bursts? 

6 . 9 . The plasma instrument on an interplanetary spacecraft detects a sudden 
increase in plasma density. No other changes in plasma or field are observed. 
Is this a shock? 

6 . 10 . The plasma instrument on an interplanetary spacecraft detects a dis- 
continuity with a jump in plasma density from 4 cm“^ to 8 cm"^ and a 
jump in plasma fiow speed from 400 km/s to 700 km/s (all quantities in the 
spacecraft frame). Determine the shock speed. What is the meaning of this 
speed? 

6.11. Assume the following average solar wind properties at the Earth’s orbit: 
proton density 7 cm“^, electron density 7.5 cm“^, He^“^ density 0.25 cm“^, 
fiow speed 400 km/s almost radial, proton temperature 2 x 10^ K, electron 
temperature 1 x 10^ K, and magnetic field 7 nT. Calculate the fiux densities 
and the fiux through a sphere of radius 1 AU for the following quantities: 
protons, mass, radial momentum, kinetic energy, thermal energy, magnetic 
energy, and radial magnetic fiux. 
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The space between Heaven and Earth - is it not like a bellow? 

It is empty and yet not depleted; 
Move it and more always comes out. 

Lao-Tzu, Tao Te Ching 



Particles in interplanetary space come from sources as diverse as the Sun, 
the planets, and the vast ness of space. The properties of the different particle 
populations provide information about the acceleration mechanism(s) and 
the propagation between source and observer. Thus energetic particles can 
also be used as probes for the properties of the interplanetary medium. To 
describe acceleration and propagation, we use concepts such as reconnection, 
acceleration at shock waves, and wave-particle interactions. 

This chapter starts with an overview of the different particle populations 
and subsequently discusses some of them in detail, in particular solar ener- 
getic particles and their propagation, shock-acceleration, particles accelerated 
at travelling interplanetary shocks and planetary bow shocks, and galactic 
cosmic rays and their modulation. 



7.1 Particle Populations in the Heliosphere 

Energetic particles in interplanetary space are observed with energies rang- 
ing from the supra-thermal up to 10^^ eV. The main constituents are pro- 
tons, o;-particles, and electrons; heavier particles up to iron can be found in 
substantially smaller numbers. The particle populations originate in differ- 
ent sources, all having their typical energy spectrum, temporal development, 
and spatial extent. Figure 7.1 summarizes the particle populations. Table 7.1 
their properties, a recent review can be found in [435]. 

7.1.1 Populations and Sources 



(A) Galactic Cosmic Rays (GCR) are the high-energy background pop- 
ulation with energies extending up to 10^^ eV. They are incident upon the 
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(A) GALACTIC 
COSMIC RAYS 

+ 

(B) ANOMALOUS 
COMPONENT 
(FROM BEYOND 50AU) 




FORWARD 

SHOCK 



(C) FLARES IN SOLAR ATMOSPHERE 
( D) FLARE -INITIATED SHOCK WAVES (ESP -EVENTS ) 



REVERSE 

SHOCK 



(E) COROTATING INTERACTION REGION 
(COROTATING EVENTS) 

(F) PLANETARY MAGNETOSPHERES AND BOW SHOCKS 



Fig. 7.1. Populations of energetic charged particles in the inner heliosphere. 
Reprinted from H. Kunow et al. [305], in Physics of the inner heliosphere, vol 
II (eds. R. Schwenn and E. Marsch), Copyright 1991, Springer- Verlag 



heliosphere uniformly and isotropically. In the inner heliosphere, the galac- 
tic cosmic radiation is modulated by solar activity: the intensity of GCRs is 
highest during solar minimum and reduced under solar maximum conditions. 
For reviews see, for example, [47, 133, 160, 163, 268, 350,416, 552]. 

(B) The Anomalous Cosmic Rays (ACR), also called the anomalous 
component, energetically connect to the lower end of the GCRs but differ from 
them with respect to composition, charge states, spectrum, and variation 
with the solar cycle [153,154,157,257]. As neutral particles of the interstellar 
medium travel through interplanetary space towards the Sun, they become 
ionized. These charged particles are convected outward with the solar wind 
and accelerated at the termination shock. Then they propagate towards the 
inner heliosphere where they are detected as anomalous component. 

(C) Solar Energetic Particles (SEPs) are accelerated in solar flares, the 
injection of these particles into the heliosphere thus is point-like in space and 
time. SEP energies extend up to some tens or a few hundred megaelectron- 
volts, occasionally even into the gigaelectronvolt range [271,359]. The latter 
can be observed with neutron monitors on the ground, the event is called a 
ground-level event (GLE). Owing to interplanetary scattering, particle events 
in interplanetary space last between some hours and a few days, depending 
on the scattering conditions and the observer’s distance from the Sun. SEPs 
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Table 7.1. Characteristics of particle populations in interplanetary space. The 
letter in the first column is the same as in Fig. 7.1, the subsequent columns give 
the temporal and spatial scales as well as the typical energy range 



Temporal 

scales 


Spatial 

scales 


Energy range 


Acceleration 

mechanisms 


A continuous 


global 


GeV-> TeV 


diffusive shock 


B continuous 


global 


10-100 MeV 


shock? 


C 5 


S keV-100 MeV 


reconnection, stochastic, 
selective heating, shock 


D days 


extended 


keV-10 MeV 


diffusive shock, 
shock-drift, stochastic 


E 27 days large-scale 


keV-10 MeV 


diffusive shock 


F continuous 


local 


keV-MeV diffusive shock, shock drift 



not only provide information about the acceleration processes on the Sun 
but also can be used as probes for the magnetic structure of interplanetary 
space. Solar energetic particle events show different properties, depending 
on whether the parent flare is gradual or impulsive [84,271,272]. In gradual 
flares, the solar energetic particles mix with particles accelerated at inter- 
planetary shocks. Although SEPs originate in flares, only a small fraction of 
all flares leads to enhancements in energetic particles above background. 

(D) Energetic Storm Particles (ESPs) are particles accelerated at inter- 
planetary shocks. Originally, ESPs were thought to be particle enhancements 
related to the passage of an interplanetary shock. The name was chosen to re- 
flect their association with the magnetic storm observed as the shock hits the 
Earth’s magnetosphere. Today, we understand the acceleration of particles at 
the shock, their escape, and the subsequent propagation through interplane- 
tary space as a continuous process, lasting for days to weeks until the shock 
Anally stops accelerating particles. The properties of particles accelerated at 
interplanetary shocks are summarized for the tens and hundreds of kiloelec- 
tronvolt range in [440, 456, 462, 523, 557] and for the megaelectronvolt range 
in [83,85,267]. At very strong shocks, protons can be accelerated up to about 
100 MeV or more [434,528]. 

(E) Corotating Interaction Regions (CIR) also lead to intensity in- 
creases. Protons with energies up to about 10 MeV are accelerated at the 
CIR-shocks [32,159,314,358,490]. The energetic particles can even be ob- 
served remote from the corotating shocks at distances where the shocks have 
not yet been formed [305,573] or at higher solar latitudes when a spacecraft 
is above the streamer belt where the CIRs form [136,306,487]. 

(F) Particles accelerated at planetary bow shocks are a local particle com- 
ponent with energies extending up to some 10 keV [107]. An exception is the 
Jovian magnetosphere where electrons are accelerated up to about 10 MeV. 
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Energy (keV) 



Fig. 7.2. Energy spectra of different ion 
populations in the heliosphere above a 
postulated quiescent background, based 
on [188] 



With a suitable magnetic connection between Earth and Jupiter, these Jovian 
electrons can be observed even at Earth’s orbit [98, 106]. 

Figure 7.2 gives the energy ranges and relative intensities for different 
ion populations in the heliosphere. The figure is limited to energies below 
100 MeV; above that energy the higher energetic end of the SEPs can be 
found as well as the galactic and anomalous cosmic radiation. The spectrum 
of the GCR is shown in Fig. 7.29. 

7.1.2 Relation to the Solar Cycle 

Figure 7.3 shows, from top to bottom, sunspot numbers, intensities of pro- 
tons above 4 MeV, and intensities of protons above 60 MeV. The particle 
data were obtained by the University of Chicago instrument on board IMP. 
The lower proton energies are predominantly influenced by solar energetic- 
particle events; the higher proton energies reflect the temporal development 
of galactic cosmic rays, and only the individual events standing out above 
the background are of solar origin. 

The main characteristics of the solar-cycle dependence in the particle 
components are (letters as in Fig. 7.1 and Table 7.1): 

A OCRs have higher intensities during solar minima and lower intensities 
during solar maxima, as evident from the bottom panel of Fig. 7.3. 

B ACRs, which constitute a minority in the lower panel of Fig. 7.3, show the 
same temporal variation. 
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Fig. 7.3. Energetic particles during the solar cycle, IMP measurements; data from 
Space Physics Interactive Data Resource (SPIDR), spidr.ngdc.noaa.gov/spidr/ 
index . html 



C SEP events are more frequent during solar minima than during solar max- 
ima (middle panel and spikes in the lower panel): active regions and fila- 
ments are more frequent during solar maxima than during solar minima. 
However, SEP events are also observed during solar minima. 

D ESP events are also contained in the middle panel and show the same 
temporal variation as SEPs. 

E CIR events are more frequent during solar minima: a CIR is formed in the 
interaction of a fast and a slow solar wind stream. The formation of the 
CIR, the development of the shocks, and the particle acceleration all require 
time. During solar maximum conditions, the stream pattern is frequently 
disturbed by coronal mass ejections and magnetic clouds. 

F Particle acceleration at planetary bow shocks appears to be rather inde- 
pendent of the solar cycle. The temporal pattern in the observation of 
Jovian electrons, for instance, is regulated not by solar activity but by 
the magnetic connection between Jupiter and the observer. Of course, the 
highly disturbed medium between the source and the observer during solar 
maxima conditions makes detection more difficult. 

In the study of energetic particles, we are interested in their source, in the 
acceleration mechanisms, and in their propagation. Occasionally, the acceler- 
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ation mechanism can be studied in situ, for example at travelling and coro- 
tating shocks and at planetary bow shocks. For most populations, however, 
the observer is remote from the source: there is no access to the acceleration 
sites of solar energetic particles or galactic cosmic rays. Even particles accel- 
erated at shocks in interplanetary space can be observed remotely from the 
shock. In all these cases, interplanetary propagation has modified the parti- 
cle populations. Therefore the properties of a particle population observed 
in interplanetary space reflect in general the combined effects of acceleration 
and propagation. 



7.2 Solar Energetic Particles and Classes of Flares 

Particle increases at the Earth’s orbit as a consequence of solar flares first 
were detected in 1942 in neutron monitor records [164]. Because of the small 
spatial and temporal extent of the particle source, SEPs not only provide 
information about particle acceleration but also can be used to study inter- 
planetary propagation. 

Solar energetic particles are mainly protons, electrons, and a-particles 
with small admixtures of ^He-nuclei and heavier ions up to Fe. Protons and 
ions can be accelerated up to some tens or hundreds of MeV /nucl, occasion- 
ally even energies in the gigaelectronvolt range can be acquired. The electron 
acceleration is limited to energies of some megaelect ronvolts. The particle 
increase above background can vary between hardly detectable and some or- 
ders of magnitude (see Fig. 7.3). While the flare lasts only for a few minutes, 
maybe even an hour, a solar energetic particle event at the Earth’s orbit typ- 
ically lasts for a day or two, depending on the number of particles injected 
into space, scattering conditions, and the presence of a coronal mass ejection 
fast enough to drive an interplanetary shock. 

Solar energetic particle events observed in interplanetary space exhibit 
different features, depending on whether the parent flare was impulsive or 
gradual [84,272]; some of these features are summarized in Table 7.2. 

What do the differences in particle populations tell us and how can they 
be related to the flare scenarios in Sect. 6.7.6? First of all, we can infer that 
different acceleration mechanisms must be at work. The high charge states 
of Fe (20) and Si (14) in impulsive events are indicative of acceleration out 
of a very hot environment with temperatures of about ten million Kelvin 
[329]. The coronal temperature of about two million Kelvin would imply 
lower charge states of about 14 for Fe and 10 for Si, such as observed in 
gradual events. Thus in the impulsive flares, the plasma must be heated 
while in gradual flares particles can be accelerated out of the corona without 
significant heating. Particle events following impulsive flares are also enriched 
in some of the heavier ions, in particular an increase in ^He/^He, Fe/C, and 
Fe/0 is observed [438]. Particle events following gradual flares, on the other 
hand, show coronal abundances. 
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Table 7.2. Properties of energetic particle events following impulsive and gradual 
flares. The numbers refer to particles with energies of a few MeV /nucl 





®He-rich 


gradual 


Particles 


electron-rich 


proton-rich 


^He/‘*He 


~ 1 (enrichment 2000 times) 


- 0.0005 


Fe/0 


1.234 (enrichment 8 times) 


~ 0.155 


H/He 


10 


100 


Qpe 


^ H-20 


~ -f-14 


Duration 


hours 


days 


Longitudinal cone 


< 30° 


< 180° 


Metric radio bursts 


III, V 


II, III, IV, V 


Coronograph 


- 


CME 


Solar wind 


- 


ipl. shock 


Event rate/a 


- 1000 


~ 10 



These differences can be understood by the two flare models presented in 
Sect. 6.7.6. Impulsive flares can be understood in terms of selective heating, 
while in gradual flares particles are accelerated out of the cool coronal ma- 
terial by the shock in front of the CME. Since the shock accelerates ions, in 
particular H, more efficiently than electrons, impulsive flares are electron-rich 
compared with gradual flares and also show a lower H/He ratio. 

The shock in front of the CME has another consequence, namely the dif- 
ferent longitudinal cones in which energetic particles can be observed. In the 
model of selective heating, particles are accelerated on a few open held lines 
adjacent to the closed loop in which the flare occurred. Thus out of the held 
lines originating on the Sun, only a small bundle is filled with particles. But 
these particles do not spread in all directions. Since SEPs are charged parti- 
cles, they propagate along the magnetic field lines but not perpendicular to 
them. Thus the particles stay on their field lines and an observer in inter- 
planetary space only detects particles if she is on one of these field lines, i.e. 
if she is magnetically connected to the source region. Owing to the curvature 
of the interplanetary magnetic held line, under normal solar wind conditions 
an observer on Earth is magnetically connected to a position at 60°W. Thus 
for impulsive events, particles are detected on a spacecraft around the orbit 
of Earth only when the flare occurred in the western hemisphere of the Sun. 

In a gradual flare, the situation is different. The shock in front of the CME 
has a large angular extent. Thus, the particles are not accelerated locally but 
over a wide region. Therefore, particles are injected into a much larger cone of 
held lines open to interplanetary space and the particle events therefore can 
be detected at larger angular distances from the site of the energy release. 
Thus, close to the Earth, particle events from gradual flares can also be 
detected if they originated in the eastern hemisphere of the Sun. 
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So far for a gradual flare we have only considered the particles accelerated 
at the shock. But in the model of a large flare in Fig. 6.37, the CME and 
the coronal shock are only one part of the event. The processes occurring 
below the filament still closely resemble those in a flare without CME. The 
characteristics of the particles interacting on the Sun can be inferred from 
the 7 -ray line spectrum, and their temporal evolution also from the hard X- 
ray emission. Corresponding observations indicate that the primary energy 
release must be quite similar in impulsive and gradual events. In particu- 
lar, although the electromagnetic emission in gradual flares lasts for a longer 
time, the elementary energy release in both classes of flares occurs in bursts 
that are similar. In addition, the compositions of the ions interacting on the 
Sun are similar and both are indicative of some process similar to selective 
heating. If the observer is magnetically connected close to the flare site in a 
gradual event, these flare- accelerated particles can also be detected in inter- 
planetary space. The upper panel of Fig. 7.4 shows the temporal evolution 
of the intensities of protons, O, and Fe in a gradual flare. In the lower panel 
the Fe/0 ratio is shown for different energies. Early in the event, the Fe/0 
ratio is high, as expected for an impulsive event. With time, the Fe/0 ratio 
decreases to a value typical of gradual events or the solar wind plasma. Thus 
early in the event, flare- accelerated particles are present while with increasing 
time the particles accelerated at the shock become more and more abundant. 
Note also the differences in the time profiles: the high-energetic Fe channel 
rises rather fast, and the intensity decreases after its maximum, i.e. a few 
hours after the flare. This closely resembles a diffusive profile as expected for 
a short injection on the Sun. In H and O, on the other hand, the intensity 
stays roughly constant or even increases towards the shock because particles 
are continuously accelerated and injected from the shock as it propagates 
towards the observer. 

We should be aware that in most cases a classification scheme is used to 
order a complex data set rather than to reflect physical processes. Nonethe- 
less, the two classes described in Table 7.2 are distinct kinds of events. But 
whether there really are only these two classes, or whether the two classes re- 
present extreme cases of a more continuous change from impulsive to gradual, 
is still under debate. In particular, there are large impulsive flares which are 
not accompanied by a CME but show characteristics which are in between 
those given in Table 7.2, while on the other hand there are disappearing- 
fllament events where the particles exhibit features typical of events follow- 
ing gradual flares, although no flare is observed. In addition, even if a flare 
is accompanied by a coronal mass ejection, the latter does not necessarily 
drive a shock. This latter point might become more interesting in the future, 
as observations with the more sensitive SOHO coronograph show that small, 
slow CMEs are relatively common and can also be observed in rather small 
impulsive flares. Here the role of the CME is probably not the acceleration 
of additional particles, because these CMEs tend to be small and slow, but 
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Fig. 7.4. Particle intensities and the development of particle composition in a 
gradual flare. Reprinted from D.V. Reames et al. [436], Astrophys. J. 357, Copyright 
1990, American Astronomical Society 



rather the opening of the flare loop in which the initial energy release oc- 
curs, allowing the particles to escape into interplanetary space. If the latter 
scenario turns out to be true, the particles observed in space should exhibit 
properties in between those given in Table 7.2: although the charge states 
should be high, the selective enrichment should be diminished because the 
accelerated particles can escape before the mechanism of selective heating has 
developed fully. A more detailed discussion of the above classification scheme 
and the two classes of particle events can be found in [435]; a critical review 
of that scheme and a discussion of the issues mentioned in this paragraph is 
given in [271]. 



7.3 Interplanetary Transport - Theoretical Background 

From the long time scales (several hours to some days) of solar particle 
events compared with the flare duration, in the 1950s Meyer et al. [356] 
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suggested that interplanetary transport might be diffusive. In this section 
we shall discuss the basic concepts of diffusion, their physical foundation in 
particle scattering at magnetic inhomogeneities (resonance scattering), and 
different transport equations. 

7.3.1 Spatial Diffusion 

Diffusion is the consequence of frequent, stochastically distributed collisions; 
thus it is a stochastic process. Therefore, it is not reasonable to discuss the 
motion of individual particles; instead one has to consider an assembly of 
particles, described by the distribution function. 

But diffusion is not only spatial diffusion. If we carefully drip a drop of 
ink into a glass of water, in time the drop will spread and eventually ink 
and water will be mixed completely: the thermal motion leads to collisions 
between ink and water molecules, distributing both species uniformly. This 
is spatial diffusion. If we carry out the same experiment with a good drop of 
cold cream and a cup of steaming hot tea, we find a second consequence of 
the collisions: after some time, tea and milk have the same temperature, thus 
thermal energy is transferred from the faster molecules to the slower ones, 
leading to diffusion in momentum space. The existence of both processes has 
already been mentioned in connection with Fig. 5.3. 

Let us start with spatial diffusion alone. All particles have the same speed 
and collisions lead to changes in the direction of motion only. To describe 
the effect of diffusion, we have to keep track of a larger number of small 
spatial steps for a large number of particles. Because the stochastic aspect is 
important, we can borrow some concepts from probability calculus and use 
simple games with coins as illustrations. 

Tumbling Drunkards and Tossed Coins. Spatial diffusion, or more cor- 
rectly, the motion of a particle in spatial diffusion, occasionally is called 
“drunkards walk” . To get the picture, imagine a couple of drunkards, happily 
lingering around a distiller. As they hear a police siren in the distance, they 
start to stagger away, everyone in his own direction. They all make steps 
of equal length but random direction. As a police helicopter arrives at the 
scene, every drunkard had made N steps of length A. The spatial distribution 
of the drunkards, as seen from the helicopter, is shown in Fig. 7.5. None of 
the drunkards has covered the maximum possible distance NX. Instead, they 
are still relatively close to the distiller. How close, compared with the max- 
imum distance, can be described by a quantity called the expected distance 
or, mathematically more correct, the average squared distance. This average 
displacement is \y/N as indicated by the circle. 

Let us now reduce the problem to the one-dimensional case: the test 
objects can only move along a straight line, again with constant step length. 
We can simulate the resulting motion by flipping a coin: a head leads to a 
step in the positive direction, a tail to a step in the negative one. Let us 
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Fig. 7.5. Distribution of drunkards staggering away from a 
distiller D. The circle indicates the average expected distance 

xVn 



consider one particle only. At first glance one might expect the expected 
distance to be close to the starting point. In particular, after a large number 
of tosses, we would expect the number of heads and tails to be roughly equal 
and therefore the net displacement to be small. This, however, is a faulty 
reversion of the law of large numbers, which is often observed in people 
gambling only occasionally: if the coin has shown tails 9 times in succession, 
the chance of heads in the next toss is exactly the same as in all previous 
tosses, 50%, because the coin does not remember the results from the last 
tosses. Thus in a long series of tosses, there can be quite a large deviation from 
a deadlock between heads and tails. This has been known since the middle of 
the seventeenth century when game theory was quite popular. Thus if for a 
long time one side of the coin can be dominant, as indicated in Fig. 7.6, then 
a large net gain for the one and a large loss for the other gambler results. 
Or, in case of one-dimensional motion, the displacement from the starting 
position can become quite large. 

The average squared distance (Ax)^, or the expected distance for short, 
can be determined easily. The total squared displacement of the particle is 
the sum of the displacements dxi in each individual step: 



rlO 



■--lO 





Fig. 7.6. Gain and 
loss chart for tossing 
coins. In the upper 
panel, the result of 
100 tosses is shown, in 
the lower panel 10 000 
tosses have been con- 
sidered 
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Fig. 7.7. Gallon board: many small 
scatters work together stochastically, 
forming a bell curve (Gauss’s distri- 
bution) 



( N \ ^ N N 

^ dx^ I = (dxi 4 - dx 2 + ... + dxiv)^ XI X] • C^-^) 

i=l / 2=1 j=l 

The individual displacements dx^ are either +A or — A, both with a probability 
of 0.5. Thus the product dx^dx^ is either A^ or — A^. For i ^ j, dx^ and dx^ 
are independent and both positive and negative values of the product have 
the probability 0.5. In the sum (7.1) these terms cancel and only products 
with i = j remain. They are always +A^ and there are N such products. 
Equation (7.1) then becomes 



{Axf = NX^ . (7.2) 

Thus with increasing number N of steps, the average displacement from the 
starting point increases as \/]V. 

If the particle has a speed v, the total distance s travelled during a time t is 
s — vt. li N is the number of direction reversals during this time interval, the 
distance also can be written as s = A^A. Therefore in (7.2) we can substitute 
N by vt/X: 

{Axf = ATA^ = vXt = 2Dt . (7.3) 

Here D is the diffusion coefficient: 

D = lvX. (7.4) 

Note that this diffusion coefficient has been defined for one-dimensional mo- 
tion. For three-dimensional motion, the diffusion coefficient becomes 

D = lvX. (7.5) 

Galton Board and Bell Curve (Gauss’s Distribution). The average 
distance is a statistical term which refers to a large assembly of particles. The 
individual particles scatter around the starting point. Their distribution can 
be described by the bell curve (Gauss’s distribution). 
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Fig. 7.8. Broadening of a Gauss dis- 
tribution with increasing standard de- 
viation. Physically, this is equivalent to 
the diffusive broadening of a distribu- 
tion with time 



The Galton board is a graphical way to derive this distribution. It consists 
of rows of pins, indicated by dots in Fig. 7.7, and models the scattering the 
particles experience: as a ball hits a pin, it is deflected either to the left or to 
the right. Then it hits a pin in the next row, which leads to another deflection 
and so on. The solid line indicates a sample path. Below the lowest row, the 
particles are collected in slots. The slot in which a ball finally comes to rest, 
results from a large number (equal to the number of rows) of stochastic 
interactions of comparable strength. If we use a large number of balls, the 
distribution in the slots will be a bell curve or Gauss’s distribution: 



Here xq is the average and a is the standard deviation. P{x) describes the 
probability of a ball to be found in the slot at position x. The standard 
deviation a defines the width of the distribution: 68.3% of all values will be 
inside the interval [xq — cr, xq + cr] and 95.4% inside [xq — 2cr, Xq + 2cr]. It is 
given as 

cr^ = I ^(x - xo)^ =: (Ax)2 , (7.7) 

and therefore describes the widening of the particle distribution or the ex- 
pected displacement from the origin. 

We can rewrite (7.6) and (7.7) to find an expression depending on the 
diffusion coefficient. With (7.3) and (7.4) we find for the standard deviation 



a = yj (Ax)^ = yj2Dt = \/v\t , 
and therefore for the bell curve 



P{x) = 



1 

\/2'KV\t 



exp 



(X - Xq)^ 

2vXt 



(7.8) 



(7.9) 



Note that the maximum stays fixed while the distribution broadens with 
time, as described by (7.3) and shown in Fig. 7.8. 
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The Diffusion Equation. If collisions happen in a homogeneous gas en- 
closed in a fixed volume, the relevant quantity to describe the diffusive pro- 
cess is the mean free path. It does not make sense to talk about a diffusion 
coefficient or an expected displacement because, viewed from the outside, 
the collisions do not change the properties of the gas, only the individual 
molecules change positions. This could be depicted by something similar to 
the Galton board: while in the Galton board the pins are arranged to form 
a triangle with the input only at the tip of the triangle, the modified board 
would consist of pins arranged in a rectangle with the input all over the top 
line. For each input slot, the spatial distribution is the same as for a Galton 
board. But the superposition of all the different input slots leads to the same 
number of particles in each output slot. For a gas this implies that on average 
for each particle leaving a volume element another one enters. 

The situation is different if there is a gradient. Then there are more par- 
ticles of the species under study in one part of the volume than in the other. 
Accordingly, a random walk carries more particles out of the volume with 
high density than particles are carried in from the lower density region. Thus 
a net transport results, reducing the gradient and eventually leading to an 
equalized distribution. The streaming S of particles can be described as 

S = -DVf/ , (7.10) 

with D being the diffusion tensor for anisotropic diffusion and U the particle 
density. The gradient is the driving force for the fiow, a larger gradient leading 
to a larger fiow. The flow also depends on the mobility of the particles, 
described by the diffusion tensor. If diffusion is isotropic, the diffusion tensor 
reduces to the diffusion coefficient and the streaming becomes S = —DVU. 
Since the diffusion coefficient depends on the particle speed and on the mean 
free path, for a given gradient the flow as well as the average displacement are 
largest for fast particles undergoing only few collisions (thus having a large 
mean free path) and smallest for slow particles undergoing many collisions. 

The equation of continuity (3.35), gives the following for a volume element 

^+^Sdo = 0. (7.11) 

0 ( 5 ) 

Here N is the number of particles and S is the flux of particles through the 
surface o of the volume element V. If U is the particle density, (7.11) yields 

^ j Ud^x+ j) Sdo = 0. (7.12) 

V 0(V) 



With Gauss’ theorem (A. 33) this is 
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With (7.10) we can write the diffusion equation as 



f =V.(DVI/). 



(7.14) 



If the diffusion is independent of the direction (isotropic diffusion), we can 
use the diffusion coefficient (7.4) instead of the diffusion tensor and get 



— = V-(£>VC). (7.15) 

If the diffusion coefficient is also independent of the spatial coordinate, as for 
example in a homogeneous medium, the equation can be reduced further: 

^ = DAU . (7.16) 

We have already encountered the one-dimensional form of this equation in 
Sect. 3.4.2 in Eqs. (3.106) and (3.107). 



Solutions of the Diffusion Equation. The solution of the diffusion equa- 
tion depends on the boundary conditions. In the general case we shall con- 
sider propagation from the source at a position vq. Thus we have to consider 
a source Q in the diffusion equation: 



dt 



DAU = Q{ro,t). 



For a spherical symmetric geometry this can be written as 



dt r‘^ dr \ ^ dr 



Q{ro,t) 



(7.17) 



(7.18) 



with Dr being the radial diffusion coefficient, which describes the diffusion 
between different radial shells [380,381]. 

The simplest case is a pulse-like injection of Nq particles at the position 
ro = 0 at time to = 0. A typical example is the injection of solar flare particles 
into the interplanetary medium. The solution of the diffusion equation for a 
radial-symmetric geometry then reads 



U{r,t) 



No 



ADrt 



(7.19) 



Two typical diffusive profiles are shown in Fig. 7.9. The intensity rises 
fast to a maximum and than decays slowly ~ The time of maximum 

tin can be determined by setting the first temporal derivative to zero: 




(7.20) 
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Fig. 7.9. Typical diffusive 
profiles for small and large As 



The time of the maximum decreases with increasing mean free path and in- 
creasing particle speed. That is what we expect from our experience with 
gases and liquids: the diffusion of a minor constituent is faster with increas- 
ing temperature (corresponding to a higher particle speed) and decreasing 
density (corresponding to an increase in particle mean free path). The time 
of the maximum increases quadratically with increasing distance. This can 
be understood easily from (7.8): the average distance increases with \/t. 

Graphically, the time to maximum can be interpreted as follows [560]: if 
we write (7.20) in the form = {r/2X) {r/v), we have r/v as the direct travel 
time for the distance r and can interpret r/2A as a measure of the number 
of mean free paths between the origin and the observer at r. The quantity 
r/2A therefore characterizes the delay due to diffusion. 

Inserting (7.20) into (7.19) gives the density at the time of maximum: 



U{r,tm) = 



No /_3 

(47rr2/6)^ ^ \ 2 



No 



(7.21) 



The intensity at the time of the maximum thus decreases with increasing 
radial distance but it is independent of the diffusion coefficient. 

Equation (7.20) is used frequently as a simple estimate of the radial mean 
free path from the time of maximum of a particle profile observed in inter- 
planetary space. Rewriting (7.20) we obtain 



Ar 




(7.22) 



Solutions of the diffusion equation so far have been obtained for the 
spheric symmetric case, assuming that particles propagate radially from one 
shell at r to the next one at r -h Ar. The mean free path Ar then refers to 
the radial mean free path. In interplanetary space, the geometry is different: 
particles propagate along the magnetic field line, thus it is more reasonable 
to use a particle mean free path Ay parallel to the magnetic field line. In 
addition, the field is not radial but Archimedian. The solution, however, is 
identical to the radial one as long as the relation 
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Ar = A|| cos^ or Dj: — D|| cos^ '0 (7.23) 

is obeyed [380,381]. Here is the spiral angle between the radial direction 
and the Archimedian magnetic field line. Note that here it is assumed that 
diffusion perpendicular to the magnetic field is negligible [158,256,432]. 



Diffusion-Convection Equation. So far, we have considered particles be- 
ing scattered in a medium at rest. A different situation arises if the medium 
is also moving, for instance an oil spill in a river: the oil is distributed by 
diffusion but is also carried along with the moving fiuid. In interplanetary 
space, the convection is due to the solar wind: the particles are scattered at 
inhomogeneities frozen into the solar wind and therefore propagating together 
with the solar wind. Thus the streaming in (7.10) has to be supplemented by 
the convective streaming Sconv = Uu, giving S — Uu — DVU. Here u is the 
velocity of the convective fiow. 

As above, the streaming can be inserted into the equation of continuity 
(3.34), giving the diffusion-convection equation 

— + V(Cu) = V(DVC) . (7.24) 

If u and D are independent of the spatial coordinate, (7.24) reads 

Or T 

+ uVU = DAU . (7.25) 

In the radial-symmetric case, the solution for a (J-injection then is 



U{r,t) 






-y {inDrty 



: exp < - 



(r — uty 



(7.26) 



For small bulk speeds u of the medium the transport equation as well as its 
solution converge towards the simple diffusion equation. 



7.3.2 Pitch Angle Diffusion 

Thus far we have not discussed the physical process of scattering. In the 
graphical description we have tacitly assumed that we are dealing with large- 
angle interactions: either the particle continues to propagate in its original 
direction of motion or it is turned around by 180°. In chapter 5, however, 
we have learned that fast particles in a plasma are more likely to encounter 
small-angle interactions. Thus to turn a particle around, a large number of 
interactions is required. 

In space plasmas small-angle interactions are not due to Coulomb-scatter- 
ing in the electric field of the background plasma but due to scattering at 
plasma waves. The physical processes will be briefiy sketched in Sect. 7.3.4; 
the formal description is similar to the one discussed for spatial diffusion. 
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Let us assume a magnetized plasma with an energy density exceeding that 
of the energetic particles: the energetic particles then can be regarded as test 
particles. Thus the particles gyrate around the lines of force and a pitch angle 
can be assigned to each particle, often written in the form /i = cos a. Each 
interaction leads to a small change in /i, i.e. a diffusion in pitch angle space. 
We can derive a scattering term strictly analogous to spatial diffusion. Let 
us start from (7.14). This can be rewritten easily: while the driving force 
for spatial diffusion is a spatial gradient, the driving force for pitch angle 
diffusion is a gradient in pitch angle space. Therefore the spatial derivatives 
have to be replaced by a derivative to /i and the scattering term reads 

I ('‘<'*>1) ' 

with K, being the pitch angle diffusion coefficient and / the phase space den- 
sity. Note that the scattering depends on /x, and thus the scattering can be 
different for different pitch angles, depending on the waves available for wave- 
particle interaction. The pitch angle diffusion coefficient can be related to the 
particle mean free path parallel to the magnetic field [215,256] by 



Here A|| does not describe the average distance travelled between two consec- 
utive small-angle scatterings, but the distance travelled before the particles 
pitch angle has been changed by 90°, i.e. the direction of motion has been 
reversed. Thus for the overall motion, A has a meaning comparable to the 
mean free path in ordinary spatial diffusion. 

The term (7.27) also can be used to describe spatial scattering if we 
also consider the field-parallel motion fiv of the particles. Thus as in the 
diffusion-convection equation we have to consider the streaming of particles 
with respect to the scattering centers. The transport equation then can be 
written as 



df df d f , ,df 



(7.29) 



Here df/ds is the spatial gradient along the magnetic field line. This depen- 
dence is sufficient, because the motion of the guiding center is one-dimensional 
along the magnetic field line and the particle gyrates around it. We will en- 
counter this equation again as part of the focused transport equation (7.36) 
for particles in interplanetary space. 



7.3.3 Diffusion in Momentum Space 

Collisions not only change a particle’s direction of motion but also its energy. 
We had already mentioned this as a basic requirement in the establishment of 
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a thermal distribution. Momentum transfer can happen by collisions between 
particles as well as by wave-particle interaction. If the energy gain in each 
interaction is small compared with the particle energy, this can be described 
as diffusion in momentum space [167,527]. Instead of particle flow, streaming 
Sp in momentum results in 

Here Dpp is the diffusion coefficient in momentum space. The second term 
describes non-diffusive changes in momentum, such as ionization, and corre- 
sponds to the convective term in the spatial diffusion equation. It therefore 
can also be described as convection in momentum space. Again, the physics 
of the scattering process is hidden in Dpp. 



7.3.4 Wave— Particle Interactions 

In this section we shall briefly introduce some of the basic processes of 
wave-particle interactions. While in all previous sections the plasma was 
regarded as well-behaved, wave-particle interactions are an example of the 
non-linearity of plasma physics. While for the linear aspects treated before a 
well-developed mathematical description is available, in the non-linear theory 
no general algorithms exist. Only few analytical methods are known, most 
of them relying on approximations. One of them is the limitation to lowest- 
order perturbations, similar to the approximation described in Sect. 4.2. Let 
us start this section with a brief introduction to quasi-linear theory. 

Quasi-Linear Theory. Quasi-linear theory is based on perturbation theory; 
interactions between waves and particles are considered to first order only. 
Thus all terms of second order in the disturbance should be small enough to be 
ignored. Only weakly turbulent wave-particle interactions can be treated this 
way: the particle distribution is only weakly affected by the self-excited waves 
in a random-phase uncorrelated way. This requirement not only corresponds 
to small disturbances in perturbation theory but even directly results from 
it as the waves are described in the framework of perturbation theory. The 
waves generated by the particles will affect the particles in a way which will 
tend to reduce the waves. Thus we assume the plasma to be a self-stabilizing 
system: neither indefinite wave growth happens nor are the particles trapped 
in a wave well. 

The basic equation is the Vlasov equation (5.23). We split all quanti- 
ties into a slowly evolving average part, such as /o, = 0, and Bq, and 

a fluctuating part /i, Ei, and Bi. The long-term averages of the fluctuat- 
ing quantities vanish: (/i) = (Ei) = (Bi) = 0. Note that in contrast to 
the ansatz in Sect. 4.1.2 here the quantities with index ‘0’ are not constant 
background quantities but slowly evolving average properties of the system. 
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These are the quantities we are interested in - the fluctuating quantities are 
of interest only in so far as they give rise to the evolution of the phase space 
density. In Sect. 4.1.2, on the other hand, we were interested in the fluctuating 
quantities because they gave rise to a new phenomenon, the waves. 

With the above ansatz, the average Vlasov equation reads 



dfo 

dt 



+ V 



V/o H V X Bo 

m 



m 

dv 



m 



{El V X B\) 



dv / 



(7.31) 



The term on the right-hand side contains the non-vanishing averages of the 
fluctuations and describes the interactions between the fluctuating flelds and 
the fluctuating part of the particle distribution. These interactions combined 
with the slowly evolving flelds on the left-hand side of (7.31) lead to the phase 
space evolution of the slowly varying part of the distribution. Note that we 
have not made any assumptions about the smallness of the fluctuations, the 
only limitation is a clear separation between the fluctuating part and the 
average behavior of the plasma. 

Equation (7.31) is the fundamental equation in non-linear plasma physics. 
Solutions, however, are difficult to obtain because they require an a priori 
knowledge of the fluctuating flelds to calculate the average term on the right- 
hand side. This term has the nature of a Boltzmann collision term. Note 
that these collisions are not particle-particle interactions but result from the 
non-linear coupling between the particles and the fluctuating wave flelds. 

If the particles and the fluctuating flelds are known, the term on the right- 
hand side can be calculated. It can then be used to derive an expression for the 
scattering coefficients mentioned above which depends on particle properties, 
in general the rigidity, and the properties of the waves, in particular their 
power density spectrum. 



Resonance Scattering. The scattering of particles by waves can be de- 
scribed as a random walk process if the individual interactions lead to small- 
angle scattering only. Thus a reversal of the direction of motion requires a 
large number of these small-angle scatters. If we assume a particle to be in 
resonance with the wave, the scattering is more efficient because the small- 
angle changes all work together into one direction instead of trying to cancel 
each other. Thus pitch angle scattering will mainly occur from interactions 
with held fluctuations with wavelengths in resonance with the particle motion 
along the held. Such a resonance interaction can formally be understood from 
a simple mechanical or electrical analogy, such as a light torsion pendulum in 
a turbulent gas or a resonant circuit excited by noise [560] . A full treatment 
of the theory with application to the scattering of particles in interplane- 
tary space was first given by Jokipii [256] and, with a somewhat different 
approach, by Hasselmann and Wibberenz [215]. 

The idea is sketched in Fig. 7.10. Let us assume a simple model of magnetic 
field fluctuations: the (relevant) waves propagate only along the magnetic field 
ik\\Bo) and the fluctuating quantities are symmetric around the wave vector. 
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Fig. 7.10. Resonance scattering: 
scattering is more efficient if the 
particle interacts only with waves 
in resonance with its motion par- 
allel to the magnetic field 



This assumption is called the slab model. Let us single out a certain wave 
number k. A particle is in resonance with this wave if it propagates a wave 
length A|| along the magnetic field during one gyration: Ay = 

With fc|| == 27 t/A|| and = 2'KjoJc the resonance condition can be written as 



t’ll fiv 



(7.32) 



The amount of scattering a particle with pitch angle a experiences basically 
depends on the power density /(fc||) of the waves at the resonance frequency. 
This dependence clearly shows why the pitch angle diffusion coefficient ti 
depends on //: particles with different pitch angles are scattered by different 
wave numbers with different power densities. 



Alfven Waves and Interplanetary Propagation. Magnetohydrodyna- 
mic waves are Alfven waves with wave vectors parallel and magneto-sonic 
waves with k perpendicular to the undisturbed field. In interplanetary space, 
the resonance scattering of energetic particles at Alfven waves plays an im- 
portant role in particle propagation. The formalism describing this process is 
derived in [215,256] or in the review [156]. 

The basic process is the interaction of the particle with the fluctuating 
component of the magnetic field. Since the fluctuations are assumed to be 
small, Bi the change in pitch angle during a single gyration is small, 

too. Scattering at waves in resonance with the particle motion parallel to 
the held is essential for efficient scattering. The relation between the power 
density of the magnetic field fluctuations and the pitch angle diffusion coeffi- 
cient is relatively simple for the limitation to waves with wave vectors parallel 
to the field and axially symmetric transverse fluctuating components (slab 
model). 

The magnetic field density power spectrum in interplanetary space can 
be described by the power law (see Sect. 6.4, in particular Fig. 6.22) /(fc||) = 
Ck'^^. Here q is the spectral shape, k\\ the wave number parallel to the field, 
and C the power at a certain frequency. The pitch angle diffusion coefficient 
is related to this spectrum by 






(7.33) 
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Fig. 7.11. Shape of the pitch angle scattering 
coefficient for different slopes q of the power den- 
sity spectrum 

with A being a constant related to the level C of the turbulence. The particle 
mean free path can then be determined by (7.28). The mean free path depends 
on particle rigidity as Ay rsj p2-q^ 

as long as q <2. 

Figure 7.11 shows the shape of the pitch angle scattering coefficient for 
different slopes q of the magnetic field power density spectrum. The case 
q = I corresponds to isotropic scattering, i.e. the strength of the scattering is 
independent of the particle’s pitch angle and the diffusion coefficient is given 
as Kiso = A {1 — ij?). With increasing q, a gap develops around /i = 0, the 
width of the gap increasing with q. Thus for particles with pitch angles close 
to 90°, scattering decreases as the power spectrum becomes steeper and it 
becomes increasingly difficult to scatter the particle through 90°, i.e. to turn 
around its motion along the field line. The physical basis is simple: particles 
with large pitch angles are in resonance with waves with small wavelength 
or high wave numbers. Since the power density spectrum decreases towards 
higher wave numbers, there is not enough power left for efficient scattering. 
For q > 2 the gap around = 0 becomes too wide to scatter particles 
through 90°: although pitch angle scattering still occurs, the direction of the 
particle motion along the field is not reversed. Or, in other words: the small- 
angle interactions do not sum up to a large-angle interaction. In this case the 
particle transport into the two hemispheres parallel and anti-parallel to the 
magnetic field is decoupled. 







7.3.5 Electromagnetic Waves 

Pitch angle scattering also occurs in the magnetosphere. Here electrons 
trapped in the radiation belts are scattered into the loss cone by Whistler 
waves. Since Whistlers are electromagnetic waves, both a fluctuating electric 
and a fluctuating magnetic field component exist. Whistlers propagate paral- 
lel to the field, and they are circularly polarized with the electric field rotating 
in the same direction as an electron gyrates (see Sect. 4.6.2). Whistler waves 
therefore have a resonance at the electron cyclotron frequency. 

Particles trapped in the radiation belt perform a bounce motion along the 
field line. Since they are trapped between two magnetic mirrors, they show a 
typical pitch angle distribution: particles with small pitch angles are missing 
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because their mirror point lies deep inside the atmosphere and thus interac- 
tion with the atmosphere is far more likely than reflection. The resulting pitch 
angle distribution is called a loss cone distribution. The population of parti- 
cles with large pitch angles, on the other hand, will bounce back and forth 
between north and south. This distribution would be stable if sufficiently 
large electron fluxes did not excite Whistlers. These Whistlers propagate on 
the background plasma. Like the electrons, Whistlers are trapped in the geo- 
magnetic held, bouncing back and forth between the lower hybrid resonance 
points. These Whistlers lead to pitch angle scattering, scattering the electrons 
into the loss cone, and providing a mechanism for the depletion of the radi- 
ation belts. This scattering is particularly strong if the electron component 
in the radiation belts is large, e.g. during a substorm, because large electron 
fluxes are necessary for efficient wave generation. The electrons scattered into 
the loss cone then precipitate into the ionosphere. 

Under these conditions the quasi-linear diffusion equation is a pure pitch 
angle diffusion equation: 
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The diffusion coefficient written in terms of the fluctuating magnetic field 
then is 
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with o;thresh being the threshold frequency up to which Whistler waves can 
be excited by electrons. 



7,3.6 Transport Equations 



To describe the development of a particle distribution we need a transport 
equation. So far, we have encountered the diffusion equation (7.14) and the 
diffusion-convection equation (7.24). 

Focused Transport. Diffusion due to scattering at magnetic held irregu- 
larities is only one aspect of particle propagation. As we saw in Sect. 6.3, the 
interplanetary magnetic field is divergent. Since the magnetic moment of a 
charged particle is constant, the particle’s pitch angle decreases as it propa- 
gates outward: a particle starting with a pitch angle close to 90° on the Sun 
will have a pitch angle of only 0.7° at the Earth’s orbit. This eflFect is called 
focusing. 

In the inner heliosphere, i.e. within about 1 AU, particle propagation 
is influenced mainly by pitch angle scattering and focusing. The resulting 
transport equation then is the focused transport equation [446]: 
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Here s is the length along the magnetic field spiral as described by (6.14), 
and ( = —B{s)/{dB/ds) is the focusing length. The terms from left to right 
describe the field parallel propagation, focusing in the diverging magnetic 
field, and pitch angle scattering. The term on the right-hand side describes 
the particle source. We have already encountered part of this equation in the 
discussion of pitch angle scattering in Sect. 7.3.2. Since convection is ignored, 
(7.36) should only be applied to particles markedly faster than the solar wind. 
Note that (7.36) cannot be solved analytically but only numerically. 

From solutions of the transport equation we can determine the phase 
space density as a function of time, location and pitch angle. Thus, for a 
fixed location not only can we determine the intensity-time profile but also 
the temporal evolution of the pitch angle distribution. The latter can be de- 
scribed as an anisotropy: if the anisotropy vanishes, particles are streaming 
isotropically from all directions, while for a large anisotropy particles pre- 
dominately come from one direction. 



Focused Transport Including Solar Wind Effects. For low-energy par- 
ticles in the inner heliosphere, both focusing and convection with the solar 
wind are strong systematic effects and should be considered in the transport 
equation. Here an extension of (7.36) has been suggested which also includes 
convection with the solar wind and adiabatic deceleration [448]. An already 
simplified form of the equation reads 
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The distribution function F(t, s, /x',p') depends on time t, distance s along 
the field line, pitch angle p' and momentum p' . The primes indicate that the 
latter two quantities are measured in the solar wind frame. Note that F is not 
the phase space density but a distribution function. In particular, in contrast 
to phase space density, it depends on momentum but not energy. 

Equation (7.37) considers, in addition to the terms already mentioned in 
connection with (7.36), the convection with the solar wind (the additional 
term in the first set of parentheses) and adiabatic deceleration (the d/dp' 
term). Adiabatic deceleration is related to solar wind expansion: as the solar 
wind expands, the distance between the scattering centers frozen into the 
solar wind increases. Thus, the “cosmic ray gas” expands too, and, therefore, 
cools. Adiabatic deceleration differs from the other transport processes, in- 
sofar as it changes particle momentum. This makes numerical solutions to 
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the transport equation even more complex since momentum is added as an 
additional dimension [269,448]. Thus, for application to observational data, 
in general the focused transport equation (7.36) or the diffusion-convection 
equation (7.24) are used. 

Figure 7.12 demonstrates the influence of the solar wind effects and their 
variation with energy. Solid lines are solutions of (7.37), dashed ones are so- 
lutions of (7.36), both for different energies between 50 keV and 340 MeV, a 
radial mean free path of 0.1 AU, and an observer at 1 AU. The considera- 
tion of solar wind effects leads to an earlier onset in particle intensity (and, 
therefore, also in anisotropy), an earlier maximum and a faster decrease in 
intensity. The earlier onset and maximum are mainly due to convection with 
the solar wind. Therefore, their effect is most pronounced in low particle en- 
ergies where the average particle speed is comparable to the solar wind speed. 
With increasing solar wind speed, convection with the solar wind becomes 
less and less important: at energies above a few MeV onset and maximum 
are the same, independent of whether solar wind effects are considered or 
not. The faster decrease in intensity (and anisotropy) is due to adiabatic de- 
celeration: for a certain energy interval more particles are removed to lower 
energies than are added from higher ones, owing to the shape of the energy 
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Fig. 7.12. Solution of the focused transport equation with {solid) and without 
{dashed) consideration of solar wind effects 
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spectrum. At energies below a few MeV, this effect is enhanced by convection 
with the solar wind. 



7.4 Interplanetary Propagation - Observations 

To infer interplanetary propagation conditions from observations, we can 
choose from two methods: (a) we can fit a suitable transport equation on 
intensity- and anisotropy-time profiles; or (b) we analyze the interplanetary 
magnetic field turbulence. In both cases, we get a pitch angle diffusion coef- 
ficient k(//) or a mean free path A. 

7.4.1 Fits with a Transport Equation 

As an example. Fig. 7.13 shows the intensity-time and anisotropy-time pro- 
files of 0.5 MeV electrons. In the upper panel, the intensity time profile shows 
the fast rise and slow decay typical of a diffusive profile (see Fig. 7.9). The 
anisotropy (lower panel) is high early in the event because the first parti- 
cles always come from the solar direction. It decays as particles are scattered 
back towards the Sun. The solid lines give a fit of (7.36) on the intensity and 
anisotropy profile with a mean free path A of 0.05 AU. 

Particle mean free paths are different from event to event and also depend 
on particle rigidity. Figure 7.14 summarizes the results from fits on a large 




1980 MAY 3 0 3 - 0 8 MeV ELECTRONS T/H 

Fig. 7.13. Typical diffusive profile of an electron event in interplanetary space. The 
solid line gives a fit with the focused transport model (7.36); the second increase 
starting at about 14 UT is due to a second flare. Reprinted from M.-B. Kallenrode 
et al. [276], Astrophys. J. 391, Copyright 1992, American Astronomical Society 
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sample of particle events with the mean free path plotted versus magnetic 
rigidity. Filled symbols refer to electron observations, open ones to protons. 
The shaded area gives the Palmer consensus range [388], where most of the 
events cluster. Here A is between 0.08 AU and 0.3 AU and, at least in this 
statistical study, is independent of rigidity. In individual events, however, a 
small increase in A with increasing rigidity can be observed. 

Over the course of time, a large number of particle events has been ana- 
lyzed. The main results can be summarized as follows: 

• Particle motion predominately is parallel to the field, diffusion perpen- 
dicular to the magnetic field can be neglected in the inner heliosphere 
[158,256,300,432]. 

• Particle profiles observed in interplanetary space are determined by both 
the time-development of the particle injection from the Sun and the sub- 
sequent interplanetary scattering. If only the intensity is considered, differ- 
ent combinations of injection and diffusion could produce the same profile. 
This problem can be circumvented by fitting simultaneously intensity- and 
anisotropy-time profiles [346,469]. 

• Particle propagation can range from almost scatter-free (with mean free 
paths of the order of 1 AU) to strongly diffusive (with A about 0.01 AU); for 
most events A|| is between 0.08 and 0.3 AU (Palmer consensus range [388]). 

7.4.2 Analysis of Magnetic Field Fluctuations 

The dotted line in Fig. 7.14 shows the expected rigidity dependence of the 
particle mean free path determined in an entirely different approach. Since 
the elementary mechanism is pitch angle scattering by resonant wave-particle 
interactions, the magnetic field turbulence determines the amount of inter- 
planetary scattering. Thus, an analysis of the magnetic field fluctuations also 
gives a particle mean free path (see Sect. 7.3.2). The fluctuations are as- 
sumed to be small {SB ^ B), and so quasi-linear theory can be applied. 
In addition, the wave vectors k of the fluctuating field are assumed to be 
parallel to the average magnetic field Bq (slab model). This approach is also 
called standard quasi-linear theory or standard QLT. As can be seen from 
Fig. 7.14, the mean free path determined with standard QLT is always below 
that obtained by fitting. In addition, standard QLT predicts an increase of A 
with rigidity while the observations suggest A to be independent of rigidity. 
This discrepancy has been known since the early 1970s [216], and it has been 
called the discrepancy problem. However, as in case of the fits, magnetic field 
fluctuations and the mean free paths derived from them turned out to be 
highly variable from event to event (or time span to time span) [547]. 

7.4.3 Comparison Between Both Approaches 

More hints on the discrepancy problem can be gained from the comparison 
between mean free paths derived from fits and magnetic field fluctuations on 
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Fig. 7.14. Particle mean free paths in different solar energetic particle events 
plotted versus the particle’s magnetic rigidity. Reprinted from J.W. Bieber et al. 
[46], Astrophys. J. 420, Copyright 1994, American Astronomical Society 



an event to event basis. Figure 7.15 shows the results of such a comparison: (a) 
the mean free paths determined from magnetic field fluctuations always (with 
one exception) are smaller than those derived from fits: Aqlt < Afit; and (b) 
the discrepancy XQur/^nt is highly variable from event to event. In addition, 
the observed rigidity dependence of the mean free path is different from that 
predicted from quasi-linear theory: for 1 MV electrons and 187 MV protons 
the ratio between the mean free paths obtained from fits is Ap/Ag = 1.6 ±0.9 
while quasi-linear theory predicts a ratio of 6 [265]. 

The main reason for this discrepancy problem appears to be the inter- 
pretation of the magnetic field fluctuations. If their scattering power is over- 
rated, the particle mean free path obtained by standard QLT would be too 
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Fig. 7.15. Comparison of mean free 
paths derived from fits (vertical axis) 
and from magnetic field fluctuations 
using standard QLT (horizontal axis) 
on an event to event basis [548]. 
Reprinted from Wanner et al. [548], 
Adv. Space Res. 13(9), Copyright 
1993, Pergamon Press 
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Fig. 7.16. Magnetic 
field fluctuations as 
2D dynamical turbu- 
lence. The reduced 
amount of slab tur- 
bulence lets the mean 
free paths approach the 
Palmer consensus range 
(shaded). Reprinted 
from J.W. Bieber et 
al. [46], Astrophys. J. 
420, Copyright 1994, 
American Astronomical 
Society 



small. Misinterpretation might be easy: observations with one spacecraft give 
a scalar power density spectrum of magnetic field fluctuations as these are 
carried across the observer. However, different three-dimensional fluctuations 
can lead to the same scalar spectrum. In standard QLT it is assumed that 
all fluctuations arise from waves with fe||^o- Thus, all power contained in 
the fluctuations also scatters the particles. To reduce the discrepancy, we can 
interpret the power-density spectrum so as to remove power from the waves 
parallel to the magnetic field. For instance, if we assume the fluctuations to 
be Alfven waves propagating radially away from the Sun, the fleld-parallel 
power available for particle scattering would be reduced [254]. This effect 
would be particularly strong at larger distances where the angle between 
the interplanetary magnetic field line and the radial direction is large. Since 
some observations suggest an increase of the discrepancy with increasing 
distance [549], this ansatz is attractive, although it is problematic because 
Alfven waves with large angles with respect to the field are easily dampened. 

A different approach interprets the magnetic field fluctuations not as 
waves but as two-dimensional dynamical turbulence [46]. Observations of 
the field fluctuations suggest that the correlation function is strongest in the 
directions perpendicular and parallel to the field [337] . Thus, the fluctuations 
seem to consist of two components: slab-like Alfven waves parallel to the 
magnetic field, and a two-dimensional turbulent component perpendicular to 
the field. The relative amount of the slab-component is between 12% and 
20%, and the two-dimensional turbulence contributes between 80% and 88% 
to the power-density spectrum. Since only the slab-component scatters the 
particles, the mean free paths determined with this modified QLT are much 
closer to the observed ones (see Fig. 7.16). In addition, for low rigidities, 
electron and proton mean free paths decouple: thus, the observed rigidity 
(in) dependence can be reproduced with this model, too. 

Nonetheless, even with this ansatz it might be too early to declare the 
discrepancy problem solved and interplanetary transport understood. In our 
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description of particle propagation we always assume that scattering occurs 
by fluctuations uniformly distributed in space and time. There are no isolated 
scattering centers. On the other hand, we know that the level of turbulence 
in interplanetary space can be quite different. We have already seen from 
Fig. 6.21 that quiet and turbulent periods exist. On shorter time scales of 
about an hour the fluctuation level can also be highly variable. Again, with 
one spacecraft it is difficult to interpret this: is the level of fluctuations uni- 
form along a magnetic field line but different on neighboring ones and do the 
changes in fluctuation level result from the motion of the observer relative to 
the field lines, or do the magnetic field fluctuations show a completely irregu- 
lar pattern with variations along the field line as well as between neighboring 
field lines? The latter case would require an entirely new approach to our un- 
derstanding of propagation in interplanetary space, in particular because we 
would have to consider a stochastic distribution of scattering centers instead 
of the continuous wave fields assumed today. 



7.5 Particle Acceleration at Shocks - Theory 

Particle acceleration at collisionless shocks can be observed best at planetary 
bow shocks and travelling interplanetary shocks. Compared with shock waves 
in other astrophysical objects, such as supernovae remnants or quasars, in 
these shocks both plasmas and energetic particles can be observed in situ. In 
this section the fundamentals of shock acceleration will be reviewed. More 
detailed reviews can be found in [259,412,506,519,524]. 

There are different physical mechanisms involved in the particle acceler- 
ation at interplanetary shocks: 

• the shock drift acceleration (SDA), sometimes also called scatter- free ac- 
celeration, in the electric induction field in the shock front; 

• the diffusive shock acceleration due to repeated reflections in the plasmas 
converging at the shock front; and 

• the stochastic acceleration in the turbulence behind the shock front. 

The relative contributions of these mechanisms depend on the properties of 
the shock. For instance, shock drift acceleration is important in perpendicular 
shocks (0Bn = 90°) where the electric induction field is maximal, but vanishes 
in parallel shocks. Stochastic acceleration requires a strong enhancement in 
downstream turbulence to become effective, while diffusive acceleration re- 
quires a sufficient amount of scattering in both upstream and downstream 
media. In addition, shock parameters such as speed, compression ratio (ratio 
of the densities in the upstream and downstream media), or Mach number, 
determine the efficiency of the acceleration mechanism. 

If we discuss particle acceleration at shocks, we treat the particles as test 
particles, which do not affect the shock. The shock is thin compared with 
the Larmor radius, and thus the adiabatic invariants still hold. The jump 
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in field and plasma may be arbitrarily large but both are homogeneous in 
both upstream and downstream media, and the shock front is planar. Thus 
effects due to the curvature, in particular drifts, are neglected. Since the 
total extension of a shock is large compared with the Larmor radius, this 
approximation is reasonable for a start. 

7.5.1 Shock Drift Acceleration (SDA) 

Shock drift acceleration (SDA) takes advantage of the electric induction field 
in the shock front [11, 122, 134, 165]. To allow for a reasonably long drift path, 
scattering is assumed to be negligible. SDA therefore is also called scatter-free 
shock acceleration. The calculations by Schatzmann [458] were the first the- 
oretical studies containing an order-of-magnitude estimate of the efficiency 
of this mechanism. They showed that a very efficient acceleration on short 
temporal and spatial scales is only possible if there is additional scattering 
which feeds the particles back into the shock for further acceleration. As we 
shall see below, this problem of feeding the particles back into the acceler- 
ation mechanism is a recurrent problem in the study of shock acceleration, 
independent of the actual acceleration mechanism. 

In shock drift acceleration, a charged particle drifts in the electric induc- 
tion field in the shock front. In the shock’s rest frame, this is 

E = -Uu X Bu = -Ud X Bd . (7.38) 

This field is directed along the shock front and perpendicular to both mag- 
netic field and bulk flow; it is maximal at a perpendicular shock and vanishes 
at a parallel shock. In addition, the shock is a discontinuity in magnetic field 
strength. Thus a particle can drift along the shock front according to (2.54). 
The direction of the drift depends on the charge of the particle and is always 
such that the particle gains energy. 

Figure 7.17 shows sample trajectories for particles in the rest frame of a 
quasi-perpendicular shock with ^Bn = 80°. The abscissa shows the distance 
from the shock in gyro-radii; the ordinate gives the particle energy in units 
of the initial energy Eq. The dashed lines indicate the shock; the upstream 
medium is to the left, the downstream medium to the right. The electric 
induction field is parallel to the shock front in the upward direction. In the left 
panel, the motion of the particle starts in the upstream medium. The particle 
then drifts along the shock front, gaining energy. This energy gain changes 
the details of the drift path: the velocity component perpendicular to the 
shock increases, eventually becoming larger than the shock speed. Then the 
particle separates itself from the shock front. In the left panel, the particle is 
refiected back into the upstream medium. The other two panels show particles 
transmitted through the shock, either from the upstream medium to the 
downstream medium (middle panel) or vice versa (right panel). 

The details of the particle trajectory, in particular the question of whether 
the particle is transmitted or refiected, strongly depends on its initial energy 
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Fig. 7.17. Shock drift acceleration: sample trajectories in the shock rest frame of a 
quasi-perpendicular shock with ^Bn = 80°. Reprinted from R.B. Decker [123], Space 
Science Reviews 48, Copyright 1988, with kind permission from Kluwer Academic 
Publishers 



and pitch angle (for additional examples of trajectories see, for example, 
[11, 123]), leading to a characteristic angular distribution of the accelerated 
particles: an initially isotropic distribution of particles in the upstream and 
downstream media is converted to a very strong field-parallel beam in the 
upstream medium and to a smaller beam roughly perpendicular to the field 
in the downstream medium. 

The energy gain of a particle is largest if the particle can interact with 
the shock front for a long time. This time depends on the particle’s speed 
perpendicular to the shock. If this is small, the particle sticks to the shock, if 
it is large, the particle escapes before it has gained a large amount of energy. 
This particle speed relative to the shock is determined by the particle speed, 
the shock speed, the pitch angle, and ^Bn- With increasing particle speed the 
range of the other three parameters, which allow for efficient acceleration, 
narrows, reducing the chance of efficient acceleration. Thus, acceleration to 
higher and higher energies becomes increasingly difficult. 

Since the magnetic moment is conserved, the momentum perpendicular 
to both shock and field after an interaction between particle and shock is 

m ^ ^ 

Pl± 



(7.39) 
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The normal component of the momentum is unchanged by the interaction. 
Thus the change in momentum is determined by the magnetic compression 
re, the ratio between the upstream and downstream magnetic field strengths. 
Equation (7.39) is valid only for perpendicular shocks. For oblique shocks, 
Toptygin [519] gives the approximation AE ^ pu^/Obu- 

The average energy gain is a factor 1.5-5. Evidently, such an energy gain 
is much too small to accelerate particles out of the solar wind to energies of 
some tens of kiloelectronvolts or even some tens of megaelectronvolts. Acceler- 
ation up to higher energies by shock drift acceleration would require repeated 
interactions between particles and shock. If a particle is in the downstream 
medium, further acceleration is not difficult because the turbulence created 
by the shock leads to stochastic acceleration or scatters the particle back 
towards the shock front. But once a particle has escaped into the upstream 
medium, the shock will not be able to catch up with it again, and thus no 
further acceleration is possible. This statement is only true for scatter-free 
conditions as originally assumed in shock drift acceleration. Space plasmas, 
however, are turbulent plasmas, and particles are scattered depending on the 
level of turbulence. This allows for repeated interactions between particles 
and shock and thus a higher energy gain [124-126]. Note that in the presence 
of scattering the energy gain in each individual interaction between particle 
and shock in general is smaller because the particle is easily scattered off 
from its drift path along the shock front. The larger net effect results from 
the larger number of interactions. 

Shock drift acceleration out of the solar wind up to energies of a few 
kiloelectronvolts to some tens of kiloelectronvolts can be observed at the 
Earth’s bow shock and at interplanetary travelling shocks (see below). 

7.5.2 Diffusive Shock Acceleration 

Diffusive shock acceleration is the dominant mechanism at quasi-parallel 
shocks. Here the electric induction field in the shock front is small and shock 
drift acceleration is negligible. In diffusive shock acceleration, the particle 
scattering on both sides of the shock is crucial. Since the scattering centers 
are frozen into the plasma, particle scattering back and forth across the shock 
can be understood as repeated reflection between converging scattering cen- 
ters. The concept was developed in the 1970s [16,39,51,137]; for reviews, see, 
for example, [13,165,315,461,462,538]. 

Figure 7.18 shows the motion of a particle in the rest frame of a quasi- 
parallel shock. The magnetic fields on both sides of the shock are turbulent; 
the resulting pitch angle scattering is quantified by the diffusion coefficients 

and or the mean free paths Au and Ad- Assume that the particle 
has just traversed the shock front in the upstream direction. The particle 
then follows a zigzag path and eventually is scattered back to the shock 
front, traversing it into the downstream medium. Here the same process is 
repeated, only with the diffusion coefficient characteristic of the downstream 
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Fig. 7.18. Diffusive shock acceleration; mo- 
tion of a particle in the shock rest frame. 
One cycle of motion consists of a crossing 
from one side of the shock to the other and 
back again 



medium. As the particle is finally scattered back towards the shock front, 
traversing it into the upstream medium, a new cycle of motion begins. 

But where in this cycle does the acceleration occur? Whereas in shock 
drift acceleration the location of acceleration is well defined, in diffusive shock 
acceleration the acceleration is given by the sum of all pitch angle scatters 
within the cycle. The energy gain can be determined from a simplification. 
Let us reduce the cycle to two isolated collisions, one in the upstream medium 
and the other in the downstream medium: in the upstream medium the par- 
ticle gains energy due to a head-on collision with a scattering center; in the 
downstream medium it loses energy because the scattering center moves in 
the same direction as the particle. Since the flow speed, and therefore the 
speed of the scattering center, is larger upstream than downstream, a net 
gain in particle energy results. The amount of energy gained in each interac- 
tion depends on the relative speed between particle and scattering centers. 
The simplified picture sketched here assumes particle motion parallel to the 
field. In reality, the particle will have a pitch angle. Then the energy gain 
depends on the particle velocity parallel to the magnetic field. Thus for a 
given particle speed the energy gain depends on pitch angle, too. 

Diffusive shock acceleration is based on scattering, which is a stochastic 
process. Therefore we cannot calculate the path of a certain particle. How- 
ever, from the initial particle distribution, the shock parameters, and the 
scattering conditions in the upstream and downstream media we can cal- 
culate the average behavior of a large number of particles. Diffusive shock 
acceleration, therefore, must be described in terms of a phase space density 
and a transport equation. The latter can be written as 



dt 
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(7.40) 



Here / is the phase space density, U the plasma speed, D the diffusion tensor, 
and T the loss time. The terms, from left to right, present: convection of 
particles with the plasma flow, spatial diffusion, diffusion in momentum space 
(acceleration), losses due to particle escape from the acceleration region, and 
convection in momentum space due to processes which affect all particles, 
such as ionization or Coulomb losses. The term on the right-hand side is a 
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source term describing the injection of particles into the acceleration process. 
In a first-order approximation losses and convection in momentum can be 
ignored, reducing the transport equation to 

^ + nv/ - V (D V/) - - Q{p, r, t) . (7.41) 

Here the terms containing the plasma speed U and the diffusion tensor D 
describe the acceleration of the particles across the shock front. 

Often we are not interested in the evolution of the particle distribution 
at the shock but only want to know the particle spectrum and the acceler- 
ation time in steady state. In this case, df/dt equals zero and the trans- 
port equation can be solved for suitable boundary conditions, such as steadi- 
ness in particle density and in the normal component of the particle flow 
S = —47rp^[Up{df /dp) + D • V/] across the shock front. This steady-state 
equation leads to predictions of the acceleration time, the particle energy 
spectrum, and the intensity increase upstream of the shock. 

Characteristic Acceleration Time. Prom the transport equation (7.41) 
the time required to accelerate particles from momentum po to momentum p 
can be inferred to be 



t = 




(7.42) 



For all momenta below p the particle distribution at the shock is in steady 
state. Note that here isotropic scattering in a homogeneous medium is as- 
sumed, reducing the diffusion tensor D to a diffusion coefficient D. 

If we assume the diffusion coefficient D to be independent of momentum 
p, then (7.42) can be integrated to give a characteristic acceleration time 

na 

Ud 

in p{t) = pQexp{t/To). Equation (7.43) is often written as 

- 

r - 1 ’ 

with r = Uu/ud being the ratio of the flow speeds in the shock rest frame. For 
a parallel shock, this ratio equals the compression ratio. In (7.44) Dd/ud is 
assumed to be small compared with D^/uu because of enhanced downstream 
turbulence. 



(7.44) 
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( 7 . 43 ) 



Example 23. Let us now assume a shock with an upstream speed of 800 km/s 
in the shock rest frame and a ratio of flow speeds r = 3. With a typical 
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upstream mean free path Au = 0.1 AU for 10 MeV protons, we obtain an up- 
stream diffusion coefficient Z>u = v\jl — 3.26 x 10^^ m^/s. With (7.44), we 
then obtain a characteristic acceleration time r = 2.3 x 10^ s, that is, almost 
27 days or one solar rotation. Even if we were to increase the scattering by 
a factor of 10, the characteristic acceleration time would be almost 3 days, 
which is longer than the average travel time of a shock between the Sun and 
the Earth. However, under these conditions, the mean free path would be 
rather small (see Fig. 7.15) and the acceleration would still be inefficient, be- 
cause during the characteristic acceleration time the momentum is increased 
only by a factor e. The situation is different at a lower particle speed, say 
100 keV. In that case, the diffusion coefficient is an order of magnitude smaller 
and, consequently, the characteristic acceleration time is only a tenth of that 
for 10 MeV protons, that is, roughly 7 hours (for the optimistic case of strong 
scattering with A = 0.01 AU). During a travel time of 2 days, the particle 
momentum therefore increases by about three orders of magnitude (under 
the simplifying assumption of a diffusion coefficient independent of particle 
energy /momentum) . □ 

The Energy Spectrum. The energy spectrum expected from diffusive 
shock acceleration is a power law: 



J(E)- JoE-^ 



(7.45) 



The spectral index 7 depends only on r. In the non-relativistic case this is 



7 = X 



lr + 2 



2r-l 



(7.46) 



In the relativistic case, the spectral index becomes 7rei = 27. Again, this 
holds only for the steady state, i.e. 

Why do we get an energy spectrum and not, for instance, a monoenergetic 
distribution? The energy gain for each particle is determined by its pitch angle 
and the number of shock crossings. The latter is determined by the stochastic 
process of scattering. For a high energy gain, the particle must be “lucky” to 
be scattered back towards the shock again and again. Most particles make 
a few shock crossings and then escape into the upstream medium. They are 
still scattered there, but they have escaped too far to be scattered back to 
the shock. Thus the stochastic nature of diffusion allows high gains for a few 
particles, while most particles make only small gains. 



Example 24- Typical values for the shock compression ratio r are between 
slightly above 1 and about 8; the average value is close to 2. With this value 
from (7.46) we obtain a spectral index 7 = 2, which is rather flat compared 
with the observed spectral indices, which cluster around 3. □ 
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Intensity Increase Upstream of the Shock. With an infinitely thin 
shock at X = 0 and a continuous particle injection Q = Po)/(47rpo) 

at the shock front, the spatial intensity variation around the shock is 

f(x,p) = f{0,p)exp{-(3i\x\} , (7.47) 

with Pi = Ui/Di, the index i indicating the upstream and the downstream 
medium, respectively. If particle escape is considered (the f/T term in (7.40)), 
the scale length Pi for the upstream intensity increase is 




Figure 7.19 shows a typical upstream intensity increase and its depen- 
dence on particle energy. If P is spatially constant, the intensity increase is 
exponential, with its slope depending on D. Because A increases with en- 
ergy (Sect. 7.4), the ramp is steeper for lower energies E\. In addition, the 
intensity at the shock is higher for lower energies, refiecting the power-law 
spectrum (7.45). In interplanetary space, the slope of the intensity increase 
is often used to determine the scattering conditions upstream of the shock. 

Example 25. With the parameters given in example 23, the scale length of 
the upstream increase is 2.5 x 10“^^ m“^, or 0.35 AU~^. Let us assume that 
the shock has reached steady state. In this case this upstream increase would 
be convected across the observer at the shock speed. If this is 1200 km/s, a 
scale length of 0.35 AU takes about 54 000 s, or 15 h, to pass the observer. 
Thus, in the intensity time profile upstream of the shock, we would expect an 
increase by a factor of e over a time of 15 h. Again the situation is different 
for 100 keV protons. Here the diffusion coefficient and thus the scale length 
are an order of magnitude smaller and, consequently, the rise in intensity 
upstream of the shock will be much steeper: a factor of e in 1.5 h. □ 
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7.5.3 Diffusive Shock Acceleration 
and Self- Generated Turbulence 

Occasionally, (7.44) is used to show that proton acceleration up to mega- 
electronvolt energies at travelling interplanetary shocks is difficult: “Yet the 
Fermi process develops so slowly that the protons accelerated by quasi- 
parallel interplanetary shocks only reach energies of a few hundred thousand 
electron volts in the one day it takes the shock to travel from the Sun to the 
Earth” [455] . Note that this conclusion depends strongly on the assumptions 
about the particle mean free path: the particle mean free paths inferred from 
solar energetic particle events, lead to acceleration times too high to explain 
the observations. For higher energies (e.g. in the megaelectronvolt or tens 
of megaelectronvolt range), the mean free path inferred from solar energetic 
particle events is so large that a particle would interact with the shock only a 
few times during its travel time from the Sun to the Earth. This even violates 
the assumption of frequent interactions inherent in the transport equation. 

However, if the turbulence upstream of the shock could be increased, the 
acceleration would be more efficient. But how can we increase the amount of 
upstream scattering? As we saw in Chap. 7.3, scattering is a consequence of 
wave-particle interactions. But waves cannot propagate away from the shock 
into the upstream medium. Thus the upstream medium does not have any 
knowledge of the approaching shock (which, of course, is inherent in the def- 
inition of a shock). This statement is true from the viewpoint of the plasma. 
But as we saw in the previous section, the energetic particles do escape from 
the shock front, as evident in the upstream intensity increase. Whenever ener- 
getic particles stream faster than the Alfven speed, they generate and amplify 
MHD waves with wavelengths in resonance with the field parallel motion of 
the particles [29], i.e. the same resonance condition as in Fig. 7.10. These 
waves grow in response to the intensity gradient of the energetic particles. 
A hydrodynamic approximation on the wave growth, treating the energetic 
particles as a fluid, can be found in [349]. Lee has introduced a model in 
which the diffusion equations describing the particle transport and the wave 
kinetic equations describing the wave transport are solved self-consistently for 
the Earth’s bow shock [312] and travelling interplanetary shocks [313]. Lee’s 
model allows some predictions regarding the particles and waves upstream of 
the shock: (i) At or just upstream of the shock the particle spectrum can be 
described by a power law in momentum / ~ with a = {1 — U 2 /ui)/?t, 
The scale-length of the intensity increase just upstream of the shock increases 
with increasing particle momentum as (ii) The upstream waves gener- 
ated by the particles propagate into the upstream direction. The growth rate 
is largest for waves with wave vectors parallel to the ambient magnetic field. 
The scale- length for the decay of the upstream waves is about the scale-length 
for the intensity increase of particles in resonance with these waves. Thus the 
scale-length of wave decay depends on the wave number. The power density 
spectrum of the excited upstream waves can be described as /(fc) ~ k^~^. 
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(iii) The ratio between the particle energy density Sp and the magnetic field 
density of the fluctuating waves {\6B\‘^)/2fio is proportional to the ratio be- 
tween upstream flow speed and Alfven speed: 



(|5B|2)/2/xo 



OC 






(7.49) 



These predictions have been found to be in fair agreement with the obser- 
vations in the 11 November 1978 event [283,315], although some differences 
in the details between the observed and the predicted turbulence have been 
found. The energy up to which the accelerated particle population is in steady 
state is a few hundred kiloelectronvolts. 

In sum, Lee’s model suggests acceleration at the shock to occur as follows. 
First, accelerated particles stream away from the shock. As they propagate 
upstream, the particles amplify low-frequency MHD waves in resonance with 
them. Particles escaping from the shock at a later time are scattered by 
these waves and are partly reflected back towards the shock. These latter 
particles again interact with the shock, gaining additional energy. Thus even 
more energetic particles escape from the shock, amplifying waves in resonance 
with these higher energies. The net effect is an equilibrium between particles 
and waves which in time shifts to higher energies and larger wavelengths. 



7.5.4 Stochastic Acceleration 

In the previous section we saw that turbulence generated (or amplified) by 
particles streaming into the upstream medium is important for efficient shock 
acceleration. In the downstream medium, the situation is different. As the 
shock passes by, waves are generated with wave vectors in both the upstream 
and downstream directions. Thus a particle can either gain energy from the 
wave or supply energy to it. This is a stochastic process, and the resulting 
particle acceleration is called stochastic acceleration or second-order Fermi 
acceleration. Fermi originally had suggested particle scattering at uncorre- 
lated magnetic inhomogeneities moving in arbitrary directions. A particle 
then gains or loses energy, depending on whether it hits the inhomogeneity 
head-on or not. Because of the Doppler effect, head-on collisions are slightly 
more frequent, leading to a net gain in energy. The first considerations of 
the possibility of second-order Fermi acceleration in the turbulence in the 
downstream medium suggested that this effect is of minor importance [367] 
compared with the first-order Fermi acceleration described above. Thus ap- 
plications of stochastic acceleration in general are limited to solar flares or 
astrophysical objects. At travelling interplanetary shocks stochastic accelera- 
tion occasionally seems to lead to intensity increases in the upstream medium 
which become evident as jumps in the intensity as the shock passes over the 
observer (Sect. 7.6). 

Physically, stochastic acceleration is based on wave-particle interactions. 
In contrast to spatial scattering here the interaction leads to scattering in 
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Fig. 7.20. Resonant wave-particle interaction: 
the particle interacts with the fluctuating electric 
field of a circularly polarized wave. The particle is 
in resonance with the wave if the wave frequency 
equals the gyration frequency. Depending on the 
phase between wave and particle gyration, the par- 
ticle is either constantly accelerated or decelerated 



momentum space. In pitch angle scattering the particle interacts with the 
fluctuating magnetic field of the waves. A different kind of interaction can 
take place if the particle interacts with the fluctuating electric field of a wave, 
in particular if this field rotates around the magnetic field line as in a circu- 
larly polarized wave. A particle is in resonance with the wave if its gyration 
frequency equals the frequency of the wave. In that case, the particle is either 
accelerated or decelerated continuously. Figure 7.20 illustrates this principle. 
In the upper panel the gyration of a particle around Bq is shown together 
with the fluctuating electric field. To make the figure readable, the fluctuat- 
ing magnetic field is not shown. In the lower panel, the two extreme cases 
are illustrated: depending on the phase between the wave and gyro-orbit, 
the particle either moves parallel or anti-parallel to the electric field. Thus, 
either deceleration or acceleration results. In the latter case, the wave energy 
is converted into particle energy, and in the former case the particle energy 
is converted into wave energy. The resulting acceleration is called stochastic 
acceleration because the result (acceleration or deceleration) depends on the 
random phase between wave and particle. 

Since the acceleration or deceleration changes the particle speed perpen- 
dicular to the average magnetic field, the particle’s pitch angle changes too. 
Stochastic acceleration can be described as diffusion in momentum space as 
long as the energy gain is small compared with the particle’s energy (see 
Sect. 7.3.3). Then the accompanying change in pitch angle is small, too. 



7.5.5 The Shock as a Non-linear System 

Shocks are highly non-linear systems with complex interactions between plas- 
mas, waves, and energetic particles. We often use linear or quasi-linear ap- 
proximations to describe some aspects of the entire phenomenon, such as 
shock formation or particle acceleration, but we have to be aware that these 
are approximations. For instance, particle acceleration and wave generation 
require energy which, of course, has to be taken from the plasma flow, thus 
altering the shock. 

Figure 7.21 sketches the complexity of the phenomenon shock. The sys- 
tem can be divided into three parts: the plasma, the wave field, and the 
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Fig. 7.21. Sketch of the non- 
linear interactions at a shock 
front. Arrows indicate the di- 
rection of energy flows, based 
on [13] 



energetic particles (all shown as rectangles). The plasma is characterized by 
its distribution function / which also serves as the injection function into 
the acceleration process, and by its hydromagnetic parameters pressure p, 
density p, bulk speed u, and magnetic field B. The energetic particles are 
characterized by their density U and their streaming S. The wave field can 
be described by the power spectrum P{k) or the average squared magnetic 
fiuctuations ((J5)^). The wave field determines the diflFusion coefficient and 
therefore the coupling between background plasma and energetic particles. 

In a linear system, we could determine the relevant quantities for the 
waves and fields independently and use the resulting diffusion coefficient and 
velocity field to calculate the acceleration of energetic particles. In a non- 
linear system, we also have to consider mutual interactions. This can be 
illustrated by any pair of shock components in Fig. 7.21. Let us start with 
the wave field and the background plasma: waves can be amplified by the 
interaction with the shock, they can be dampened by the plasma, feeding 
wave energy into the plasma, or the wave pressure and energy flux can change 
the dynamics of the plasma. The non-linear interaction between another pair 
of shock constituents, the wave field and the energetic particles, has been 
partially discussed before: energetic particles can excite or amplify waves by 
streaming instabilities while these waves in turn scatter the particles, leading 
either to second-order Fermi acceleration in the downstream medium or more 
efficient first-order Fermi acceleration. Part of the particle’s energy therefore 
can be transferred to the plasma by first converting it to wave energy which 
is then converted into plasma energy by damping. 

A shock therefore is a complex, highly non-linear system. Even if all rel- 
evant equations can be written down, it is not possible to solve them self- 
consistently. Therefore, either approximations can be used, such as the ones 
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presented in this chapter, or the system or part of the system can be studied 
by means of Monte-Carlo simulations. 

Another crucial aspect of shock acceleration not addressed so far is the 
injection problem: to be accelerated, the particle must be sufficiently ener- 
getic to be scattered across all the micro- and macro-structures of the shock 
to experience the compression between the upstream and the downstream 
medium. Thus although we assume particle acceleration out of the solar wind, 
not all solar wind particles can be accelerated because they are not energetic 
enough. Different scenarios can be developed ranging from the acceleration 
of particles out of the high energetic tail of the distribution only to some 
kind of pre-acceleration by direct electric fields or the requirement of a pre- 
accelerated particle component. A review of this injection problem is given 
in [568]. 

7.5.6 Summary Shock Acceleration 

Shocks provide an important acceleration mechanism for particles of plane- 
tary, solar and even galactic origin. Three acceleration mechanisms can be 
distinguished which are not mutually exclusive but work together with dif- 
ferent relative contributions depending on the properties of the shock: 

• In shock drift acceleration (SDA, also called scatter-free acceleration) par- 
ticles gain energy by drifting along the electric induction field in the shock 
front. The energy gain is between a factor of 1.5 and 5, and the particle 
interacts with the shock only once. If particles are scattered in the ambient 
plasma, the energy gain per interaction is reduced because the drift path is 
shortened, but particles can interact repeatedly with the shock. This might 
lead to a higher total energy gain. 

• In diffusive shock acceleration the particles gain energy due to repeated 
scattering in the plasmas converging at the shock front (first-order Fermi 
acceleration). The time scales of acceleration and the energies acquired 
depend on the amount of scattering. Diffusive shock acceleration is a slow 
process, the maximum energy acquired depends on scattering conditions, 
shock parameters, and the time available for acceleration. 

• Diffusive shock acceleration becomes more efficient if the turbulence cre- 
ated by the particles is considered. This self-generated turbulence in the 
upstream medium scatters the particles back towards the shock front, lead- 
ing to further acceleration. This is an example for the non-linearity of pro- 
cesses at a shock. 

• Stochastic acceleration is a second-order Fermi process during which the 
particle gains energy due to scattering in the downstream turbulence. 
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7.6 Particles at Shocks in Interplanetary Space 

In Fig. 7.4 we saw a particle event consisting of both a flare-accelerated 
component and a particle component accelerated at the shock in front of 
the CME. Let us now take a look at the particle distribution right at an 
interplanetary travelling shock. 

Figure 7.22 shows the energy spectrum between 200 eV and 1.6 MeV for 
ions in the spacecraft rest frame just downstream of an interplanetary shock. 
The pronounced peak around 1 keV is the solar wind plasma; the power- 
law spectrum for higher energies gives the energetic storm particles. The 
rather smooth transition between these two spectra is in agreement with the 
assumption that the particles are accelerated out of the solar wind plasma. 
While the solar wind spectrum is described by a Maxwellian, the combined 
spectrum can be approximated by a kappa-distribution (5.14). Note the break 
in the power-law spectrum at energies of a few hundred kiloelectronvolts: here 
the spectrum is steeper, indicating a less efficient acceleration. 

It appears reasonable to discuss energetic particles accelerated at inter- 
planetary shocks in two separate energy bands: (a) a low-energy component 




Fig. 7.22. Spectrum of energetic ions from 200 eV to 1.6 MeV at an interplanetary 
shock in the spacecraft’s reference frame. Reprinted from J. Gosling et al. [201], J. 
Geophys. Res. 86, Copyright 1981, American Geophysical Union 
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with ion energies up to a few hundred keV/nucl and electron energies up 
to some tens of kiloelectronvolts; and (b) a high-energy component with ion 
energies in the megaelectronvolts and tens of MeV / nucl range and electron 
energies in the hundreds of kiloelectronvolt to megaelect ronvolt range. One 
reason for such a separation is the break in the ion spectrum, as indicated in 
Fig. 7.22. Another argument stems from the different particle speeds relative 
to the shock: while protons in the tens of kiloelectronvolt range on average 
are only slightly faster than the shock and therefore stick close to the shock 
front, protons in the megaelectronvolt range are much faster and can escape 
easily from the shock. Thus, an observer in interplanetary space at lower 
energies detects the shock-accelerated particles mainly around the time of 
shock passage, while at higher energies particles accelerated at the shock can 
be detected when the shock is still remote from the observer. In addition, 
acceleration is more efficient if particles stay close to the shock and interact 
repeatedly than for particles that escape easily from the vicinity of the shock. 

Particle events associated with interplanetary shocks are reviewed, for 
instance, in [266,271,435]. 

7.6.1 Low-Energy Particles (Tens of keV) 
at Travelling Shocks 

At low energies, three types of particle events can be distinguished, each 
related to another acceleration mechanism (see Fig. 7.23). Nonetheless, as we 
saw in Sect. 7.5, combinations of acceleration mechanisms are possible and 
so too are combinations of different types of events. 

In the left panel of Fig. 7.23 an energetic storm particle event is sketched. 
This is the classical type of particle event at interplanetary shocks. In the 
particle data, it starts some hours prior to shock arrival at the decaying 
flank of the preceding solar energetic particle event. The intensity maximum 
is close to the time of shock passage. Often there is no dip between the 
first maximum and the maximum at the time of shock passage but a more 
plateau-like profile or continuous increase as in the protons in Fig. 7.4. In 
many cases, a few hours after the arrival of the shock, a sudden decrease in 
intensity is observed as the spacecraft enters the ejecta or magnetic cloud, 
the former coronal mass ejection, driving the shock. With increasing energy, 
the intensity increase related to the shock becomes smaller, reflecting the 
rather steep energy spectrum of ESPs compared with SEPs. These events 
are connected with quasi-parallel shocks, and thus the particles are acceler- 
ated by diffusive shock acceleration. The features of the particle event, for 
instance the quasi-exponential intensity increase upstream of the shock and 
the energy spectrum, are in agreement with the predictions of this accelera- 
tion mechanism. In addition, the enhanced upstream turbulence is indicative 
of self-generated turbulence [283,456]. 

At quasi-perpendicular shocks, a different type of particle event can be 
observed, namely the shock spike (middle panel in Fig. 7.23). Here the inten- 
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ENERGETIC STORM SHOCK POST SHOCK 

PARTICLES SPIKES INTENSITY 




REPEATED REFLECTION REPEATED CROSSING a) TRAPPING OF PARTICLES IN 

FROM MOVING SHOCK OF SHOCK FLUCTATIONS BEHIND SHOCK 

{FINITE INJECTION) 
b) FERMI ACCELERATION (SECOND 
ORDER) AT VERY LOW ENERGIES 



Fig. 7.23. Different types of energetic particle events in the tens to hundreds 
of kiloelectronvolt range at interplanetary shocks and the acceleration mechanisms. 
Reprinted from M. Scholer and G. Morfill [464], in Study of travelling interplanetary 
phenomena^ 1977, Air Force Geophysics Laboratory 



sity increase is limited to a few minutes around the time of the shock passage. 
Despite its short duration, the intensity increase can be quite large. Shock 
spikes are explained by shock-drift acceleration. However, to acquire the ob- 
served energies, the particles must cross the shock more than once, and thus 
here shock-drift acceleration works in combination with scattering and not as 
scatter-free shock acceleration. The details of the particle event (single-spike, 
multiple-spikes, etc.) are more variable than in the classical ESP events. This 
structuring, however, is not necessarily related to the acceleration process 
but might also reflect the differences between neighboring field lines or the 
variability of the interplanetary medium on short temporal and spatial scales. 

The third type of event, the post-shock increase, can occur in isolation, 
on the decaying flank of a SEP event or in the wake of an ESP event. In 
the particle data, it can be identified as an abrupt increase in intensity as 
the shock passes the observer. It often ends abruptly as the observer en- 
counters the ejecta. Post-shock intensity increases are generally associated 
with strong turbulence in the downstream medium. This turbulence leads to 
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an efficient stochastic acceleration and a storage of the accelerated particles 
downstream of the shock front. In the low-energy electrons, the post-shock 
increases are the most common type of event; however, in ions and higher 
energetic electrons they are less frequent. 

While the local geometry, the angle 0Bn between the magnetic field direc- 
tion and the flare normal, determines the shape of the shock-related particle 
increase, its size is determined by the compression ratio, the magnetic com- 
pression, and the shock speed vsb = sec 9bu parallel to the magnetic field. 
This latter parameter even seems to distinguish between shocks with little 
or no particle acceleration {vsb < 250 km/s) and shocks that lead to sig- 
nificant particle acceleration [523]. Although vsb is less expressive than the 
shock speed, its physical significance is more important: as the particles are 
bound to travel along the magnetic field line, the shock speed parallel to the 
field determines the ability of a particle to escape from the shock. Even at 
rather slow shocks, a large number of particles might be kept close to the 
shock for further acceleration as long as ^Bn is large, i.e. the shock is quasi- 
perpendicular. The physical significance of the magnetic compression also 
might be simple: the change in B across the shock front creates a magnetic 
mirror for particles moving from the upstream medium towards the shock. As 
these particles are reflected at this mirror, they gain energy. This process is 
faster and more efficient than the reflection in the turbulence downstream of 
the shock as assumed in standard diffusive shock acceleration. However, for 
efficient particle acceleration multiple encounters with the shock are required, 
and thus again the upstream turbulence is the crucial factor. For one particle 
event a complete analysis of particles and waves upstream of the shock has 
confirmed the existence of self-generated turbulence and its good agreement 
with the theoretical predictions [283] . 

7.6.2 High-Energetic Particles (MeVs) at Travelling Shocks 

In megaelectronvolt protons and ions different intensity profiles can be distin- 
guished, too. However, they do not reflect the local acceleration mechanism 
as in the low energies but the location of the observer relative to the shock. 
This is a consequence of the higher speeds of the megaelectronvolt particles 
which allows them to escape from the shock front. An observer in interplan- 
etary space thus samples all particles that the shock has accelerated on the 
observer’s magnetic field line while it propagates outward. The intensity pro- 
file therefore is the superposition of particle injections with different sizes and 
at different positions. 

Figure 7.24 shows sketches of typical intensity profiles of 20 MeV pro- 
tons for different locations at an interplanetary shock, the latter propagating 
downward. The intensity of the first strong intensity increase is largest for 
an observer on the eastern flank. Since these particles arrive very early, they 
must have been accelerated at a time when the shock still was close to the 
Sun. This component can be called a solar component because the particles 
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Fig. 7.24. Representative profiles of about 20 MeV proton events for different 
positions of the observer with respect to the shock. The draping of the field lines 
around the ejecta is only a suggestion. Reprinted from H.V. Cane et al. [85], J. 
Geophys. Res. 93, Copyright 1988, American Geophysical Union 



are accelerated close to the Sun: either by the flare, by the shock, or by a 
combination of both. Going towards the central meridian or the western flank 
of the shock, the intensity of the solar component decreases. The intensity of 
the shock- accelerated component, on the other hand, is largest at the nose 
of the shock at the central meridian while it decreases towards the flanks. 
Note that here shock- accelerated means “accelerated at the shock while it 
propagates outward” . 

Now we can easily understand the different profiles. The observer in panel 
D is located at the eastern flank of the shock and is magnetically connected 
to the flare site. Thus he sees a strong intensity increase early in the event 
due to the solar component. Often the flare- accelerated particles might be 
identified from their charge states and composition (see Fig. 7.4). The local 
acceleration at the shock, however, is rather meager because the observer is 
located only on its flank. Going to the east, observer A is located at the nose 
of the shock but is magnetically connected to a position about 60° west of 
the flare site. The solar component therefore is smaller; however, the particle 
component accelerated at the shock is maximal. Whether the first increase is 
larger than the one close to the shock, as depicted in the figure, or vice versa, 
depends on the properties of the flare, the shock’s acceleration efficiency, 
and the scattering conditions in interplanetary space. Moving farther to the 
west, the solar component decreases or becomes undetectable because of an 
insufficient magnetic connection. The component accelerated at the shock is 
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also smaller than the one at the central meridian; however, it is always larger 
than the solar component. 

Note that Fig. 7.24 is a schematic based on a statistical study. The pattern 
of the changes in the profiles along the shock front have also been confirmed 
by multi-spacecraft observations at individual shocks; however, the variations 
are not necessarily as pronounced as those indicated in Fig. 7.24 [270]. Fig- 
ure 7.25 shows an example of a shock observed by three spacecraft at different 
positions: the top panel shows the geometry, the center of the shock prop- 




September 23, 1978 T/H 

Fig. 7.25. Dependence of the particle profiles on location of the observer from 
multi-spacecraft observations (Helios and IMP); data from the University of Kiel 
particle instrument on board Helios and the Goddard Spaceflight particle instru- 
ment on IMP 
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agating straight downwards. The three spacecraft were separated by almost 
180°. The particle intensities are given for 0.5 MeV electrons (top, 1 indicat- 
ing Helios 1 observations, 2 indicating Helios 2), ~7 MeV protons (middle; 
curve 3 is for IMP), and ~30 MeV protons (bottom); the arrows indicate 
the arrival of the shock. Although the variations with the location of the ob- 
server relative to the shock follow the trend shown in Fig. 7.24, they are much 
smaller than suggested there. Note the post-shock increase in the Helios 2 
data, which is one of the rare occasions of a clear indication of particle stor- 
age and acceleration in the turbulence behind the shock front at the rather 
high energies shown here. 

In addition, we should be aware that all shocks and flares exhibit their 
own features, and therefore the relative intensity between the first intensity 
increase and the hump at the time of shock passage can vary by orders of 
magnitude, even at a fixed location relative to the shock. Thus the typical 
profiles sketched in Fig. 7.24 can also be observed some ten degrees farther 
to the west or to the east. 

As at low energies, the intensity at the time of shock passage is correlated 
with the magnetic compression and the shock speed [267]. However, these 
correlations are rather weak, indicating that other parameters might influence 
the acceleration efficiency too. In addition, we should expect only a weak 
correlation, because the shock parameters are measured locally at the position 
of the observer while the particle event is a superimposition of all injections 
along the observer’s magnetic held line, where the shock parameters of the 
various injections most likely would have been different. 

Note that the details of the particle profile, and, in particular, the relative 
sizes of the solar and shock-accelerated components also vary with energy. 
Figure 7.26 shows proton intensities in five energy channels between 15 MeV 
and 850 MeV as observed by GOES during the Bastille Day event of 14 
July 2000. The vertical lines indicate the arrival of two shocks, the second 
shock belonging to the flare and particle event under study. The highest 




Fig. 7.26. Proton intensi- 
ties between 15 MeV {upper 
curve) and 850 MeV {lower 
curve) observed by GOES 
during the Bastille Day event 
of 14 July 2000. Data courtesy 
NOAA-SPIDR 
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energies (bottom curve) closely resemble the profiles of a solar event (see 
Fig. 7.13), with only a very small increase in intensity around the arrival 
of the shock: the particles must have been accelerated on or close to the 
Sun. With decreasing energy (going upwards from curve to curve), the bump 
of ESP particles becomes more pronounced; at about 20 MeV the intensity 
continues to rise from the onset until the shock arrives. In the latter case, 
particle acceleration at the shock in interplanetary space is dominant, and 
particles accelerated at or close to the Sun play only a minor role. Although 
the relative sizes of the solar and shock-accelerated particle components are 
different, this energy dependence is similar in all events. 

The observations clearly show that megaelectronvolt protons are accel- 
erated at interplanetary shocks. But how? Of the acceleration mechanisms 
discussed in Sect. 7.5, only diffusive shock acceleration might to be able to 
explain the observations. Stochastic acceleration in the turbulence behind 
the shock appears to be too slow. In addition, it would keep the particles 
confined in the downstream medium, leading to a post-shock increase. The 
latter can be observed in megaelectronvolt protons, although not frequently. 
Shock-drift acceleration also is not very likely because its characteristic fea- 
ture, the shock spike, is not observed. In addition, it is debatable whether 
under interplanetary conditions SDA will be able to accelerate protons to 
megaelectronvolt energies. 

But the explanation of the observations by diffusive shock acceleration 
also is inconsistent. For instance, if we assume typical scattering conditions, 
the characteristic acceleration time would be much larger than the travel 
time of the shock from the Sun to the observer. If, on the other hand, we use 
the upstream intensity increase to determine A according to (7.47), we find 
mean free paths much lower than those typically observed in interplanetary 
space, even lower than those inferred from the magnetic field fluctuations 
using standard QLT (which always are a lower limit only) . The characteristic 
acceleration times inferred with these As, on the other hand, are still larger 
than the shock’s travel time. Thus the predictions of diffusive shock acceler- 
ation cannot be used to derive a consistent picture. In addition, evidence for 
self-generated waves upstream of the shock still is missing [270]. Nonetheless, 
this does not imply that the particles are not accelerated by this mechanism. 
First of all, all the tests and predictions assume a steady state. Obviously, 
this is not acquired because it would require acceleration times larger than 
the shock’s travel time. Instead, a steady state is acquired up to a few hun- 
dred kiloelectronvolts only, as is also evident in the break in the power-law 
spectrum in Fig. 7.22. But even if a steady state is acquired only up to a 
lower energy, some particles have already been accelerated to much higher 
energies. In addition, the higher energies might require the consideration of 
additional eflFects, in particular particle escape from the shock or the curva- 
ture of the shock front. Thus the acceleration of megaelectronvolt protons 
at interplanetary shocks is still not completely understood. One attempt to 
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gain insight into the problem is the simulation of the observed particle pro- 
files with a model that takes into account the propagation of the shock and 
a continuous, though variable, injection of particles, combined with the sub- 
sequent interplanetary propagation of the particles [222,268,275]. Currently 
the shock is assumed to be a black box, although the inclusion of a real 
acceleration mechanisms, for instance time-dependent diffusive shock accel- 
eration, should be the target. Nonetheless, such a model still yields valuable 
results, in particular the variation of the particle injection from the shock as 
the latter propagates outward can be determined, allowing us to determine 
the change in shock acceleration efBciency with time. 

In the above discussion, we have left open one question: how is the solar 
component accelerated, at the shock or in a flare process? This question is 
hotly debated. Originally [84, 276] the shock-accelerated particles were con- 
sidered as a particle component in addition to the particles accelerated in 
the flare. Later, the current paradigm evolved that in gradual events all par- 
ticles, and therefore also the solar component, are accelerated at the shock 
as it propagates outwards [435]. At present, the pendulum appears to be 
swinging back a little because there are a number of indicators that particles 
accelerated in the flare also contribute to the event observed in interplanetary 
space [271]. (1) The particles accelerated in the flare leave clear signatures of 
their composition and spectra in the form of 7-ray line emission. This is essen- 
tially the same in impulsive and gradual flares: “in both impulsive and gradual 
flares the particles that interact and produce 7-rays are always accelerated 
by the same mechanism that operates in impulsive flares, namely, stochas- 
tic acceleration through gyro-resonant wave particle interactions” [331]. (2) 
In many particle events, the composition evolves from flare-like to shock- 
accelerated during the event, as already indicated in Fig. 7.4 [528]. (3) With 
increasing energy, the charge states more closely resemble those in flares than 
those in the corona [105,339,360]. 

This discussion is important insofar as it is directly related to the injection 
problem: if particles from a flare are already present, acceleration by the shock 
must work on a preaccelerated population. In addition, the shock must only 
reaccelerate particles, but not accelerate particles out of the solar wind - 
thus, to explain the observations, an energy gain by a factor of say five or 
an order of magnitude might be sufficient, instead of the several orders of 
magnitude required in the case of acceleration out of the solar wind. There 
is also observational evidence for reacceleration: enrichments in ^He, which 
are believed to be a typical signature of acceleration in the flare process, of 
between 3 and 600 times the background ratio, have been found at shocks 
in EPS events, with the probability for a given particle event at a shock 
to be ^He-rich increasing with increasing solar activity, that is, when more 
^He from earlier flares is in interplanetary space [130]. Thus particles from 
previous impulsive events are still around as the shock passes by and form a 
seed population: residual ^He and Fe ions from impulsive solar flares can fill 
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a substantial volume (>50%) of the in-ecliptic interplanetary medium during 
periods of high solar activity [336]. 

7.6.3 Particles at CIR Shocks 

Particles observed at corotating interaction regions are also accelerated at 
shocks. As in the case of travelling shocks, particles and shocks can be ob- 
served. Thus the similarities between the two particle components are: (a) 
particles are accelerated at shocks (acceleration process), (b) particles are 
accelerated out of a seed population (injection problem), and (c) particles 
are accelerated remotely from the observer (propagation problem). How- 
ever, there are also some differences between the two populations of shock- 
accelerated particles, see Table 7.3. 

The most important features of particle events in association with CIR 
shocks are (see [335]): (1) the maximum particle intensities are observed at 
the reverse shock at ~4 AU and 20° heliolatitude, (2) the particle flow at 1 AU 
is mostly sunward with occasional significant non-field- aligned anisotropies, 

(3) low-energy ions can be observed at 1 AU even if the shock is not observed, 

(4) the composition of the ions resembles that of the solar wind except for 
enhancements in He and C relative to O, and (5) the particle ratios He/0, 
C/0, and Ne/0 increase with increasing speed of the high-speed stream. 
Basically, these observations fit into the same picture as used in the current 
paradigm of particle acceleration at travelling interplanetary shocks: particles 



Table 7.3. Comparison between particles accelerated at travelling shocks and at 
CIR shocks 

Particles accelerated at CME-driven Particles accelerated at CIR shocks 
shocks 



Particle energies up to some 100 MeV 
(protons) and some MeV (electrons). 

Particles stream away from the shock 
(the sign of anisotropy changes as the 
shock passes by). 

Most efficient acceleration close to the 
Sun. 

At low energies mainly, solar wind com- 
position but there are exceptions (a) 
for individual events, (b) in energy, (c) 
with time. 

Particles can be observed even if the 
shock is not observed. 

No evidence for pickup of anomalous 
cosmic rays. 



Particle energies limited to about 
10 MeV (protons) and 200 keV (elec- 
trons) . 

At 1 AU, particles stream towards the 
Sun. 

Most efficient acceleration at about 
4 AU. 

Nearly solar system composition, ex- 
cept for enhanced He and C relative to 

O. 

Low energy ions observed at 1 AU or 
high latitudes in the absence of shocks. 
Singly charged He indicates pickup of 
ACRs. 
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are accelerated by the shock (acceleration mechanism) out of the solar wind 
(source population and injection) and propagate almost scatter- free (propa- 
gation) except in the vicinity of the shock, where strong scattering is required 
for efficient particle acceleration. 

The difference in the particle events at CIR shocks and travelling shocks, 
as outlined in Table 7.3, results from the different properties of the shocks: 
while travelling shocks are strong, often high-speed shocks with travel times 
between the Sun and the Earth of about 2 days, CIR shocks are rather weak, 
slowly moving shocks with lifetimes of many solar rotations. In addition, 
while the geometry is quasi-parallel in travelling shocks, at least close to the 
Sun, in CIR shocks it is quasi-perpendicular rather than quasi-parallel. Thus 
the boundary conditions for the acceleration mechanism in travelling and 
CIR shocks are quite diflFerent. In particular, the main problem for travelling 
shocks, the small time span available for acceleration, does not apply to CIR 
shocks. Thus it is even more tempting to assume the paradigm of shock 
acceleration out of the solar wind to be valid, as discussed in [435]. 

Nonetheless, as in the case of travelling shocks, recent observations pose 
challenges to this paradigm. A major challenge is the observation of singly 
charged ions in the particle population accelerated at CIR shocks. These 
pickup ions, which have entered the heliosphere as neutrals and are ionized 
by the Sun’s hard electromagnetic radiation on their way towards the inner 
heliosphere, are picked up for acceleration with much higher efficiency than 
the ambient solar wind [189,340]. 

7.6.4 Particles at Planetary Bow Shocks 

Particles and waves can also be observed upstream of planetary bow shocks 
which form when the supersonic solar wind hits the obstacle magnetosphere 
and is slowed down to subsonic speed. Basically, the bow shock is parabolic 
and symmetric around the Sun-Earth line. At the Earth’s position, the in- 
terplanetary magnetic field spiral has an angle of 45° with respect to the 
Sun-Earth axis. Thus along the bow shock, the local geometry Obu is highly 
variable (see Fig. 7.27), ranging from quasi-perpendicular close to the nose 
of the bow shock to quaisi-parallel at the dawn side of the magnetosphere. 

Upstream of the bow shock, a foreshock region develops which is char- 
acterized by energetic particles streaming away from the shock front and by 
waves excited by these particles. Depending on the local geometry dif- 
ferent particle distributions and different types of waves can be observed. 
Close to the nose of the shock, the subsolar point, the geometry is quasi- 
perpendicular and shock-drift acceleration leads to a particle distribution 
in the form of a rather narrow beam of reflected ions, see the left panel of 
Fig. 7.28. The spiky peak in the middle of the distributions is the solar wind 
while the broader peaks are reflected and accelerated ions. The wave field 
consists of low- amplitude waves with frequencies of about 1 Hz. 
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Fig. 7.27. Geometry of 
the bow shock and waves 
in the upstream and 
downstream media. The 
different ion distributions 
are indicated 



At the dawn side the geometry is more quasi-parallel and particles are ac- 
celerated by diffusive shock acceleration. The resulting particle distribution 
is a diffusive one, forming a ring around the solar wind peak, as is evident 
in the right panel in Fig. 7.28. The waves excited by these ions again are 
low-frequency waves with larger amplitudes, occasionally containing shock- 
lets in the sense of discrete wave packets which often are associated with 
discrete beams of particles. The relationship between particles and waves 




Fig. 7.28. Particle distributions upstream of the Earth’s bow shock. In the beam 
distribution (left) particles propagate from the shock front towards the solar wind 
only while the diffusive population (right) covers all pitch angles. In both panels, 
the sharp peak in the middle is the solar wind incident on the bow shock. Reprinted 
from G. Paschmann et al. [401], in J. Geophys. Res. 86, Copyright 1981, American 
Geophysical Union 
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can be described in the model of coupled MHD wave excitation and ion ac- 
celeration [312]; see Sect. 7.5.3. 

In between, the local geometry is oblique and both mechanisms contribute 
to the particle acceleration. The resulting particle distribution is an interme- 
diate distribution in which both the reflected beam and the more diffusive 
population can be found. The accompanying wave held basically consists of 
transverse low-frequency waves. 

The electrons form a foreshock region, too. It starts close to the nose of 
the magnetosphere (see Fig. 7.27): at the nose of the bow shock, the geometry 
is quasi-perpendicular and particles are accelerated by shock-drift accelera- 
tion. Since the gyro-radii of the electrons are much smaller than those of the 
protons, their drift path along the shock front is shorter, leading to an earlier 
escape and therefore a more extended foreshock region. 

Compared with the particle populations observed at interplanetary shocks, 
the bow shock particles, although signiflcantly more energetic than the solar 
wind, are still low-energy particles. Electron acceleration is observed only up 
to a few kiloelectronvolts, ion acceleration up to some tens of kiloelectron- 
volts. Waves and particles upstream of bow shocks cannot only be found on 
Earth but also on other planets [386,451,454]; see Sect. 9.4.4. 



7.7 Galactic Cosmic Rays (GCRs) 

Galactic cosmic radiation is incident on the solar system isotropically and 
constantly. It basically consists of hydrogen, helium, and electrons, but also 
heavier ions such as C, O, and Fe. Energetically, the galactic cosmic radiation 
starts where the spectrum in Fig. 7.2 ended, at some tens of MeV/nucl. 
The energy spectrum has a positive slope, i.e. the intensity increases with 
increasing energy, up to some hundreds of megaelectronvolts (see Fig. 7.29). 
At higher energies, the spectrum has a slope of —2.5. Very few of the GCRs 
can have energies up to 10^^ eV, corresponding to about 20 J (that is the 
kinetic energy contained in an apple of 200 g moving at a speed of 50 km/h). 
GCRs hit the Earth at a rate of about 1000/ (m^s). 

7.7.1 Variations 

The intensities of galactic cosmic rays vary on different temporal and spatial 
scales. Some of these variations will be discussed here. 

Modulation with the Solar Cycle. At energies below a few gigaelectron- 
volts, the GCR intensities show a strong dependence on solar activity with 
a maximum during the solar minimum (see Fig. 7.30). This effect is called 
modulation. With increasing solar activity, the maximum of the energy spec- 
trum shifts towards higher energies. At proton energies of about 100 MeV 
the modulation is maximal, while at energies of about 4 GeV the modulation 
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Fig. 7.29. Spectrum of galactic 
cosmic radiation. At energies below 
some GeV the three curves for each 
particle species indicate the spec- 
tra for solar minimum {upper curve), 
average {middle curve), and solar 
maximum conditions {lower curve). 
Reprinted from Physics Today 27, P. 
Meyer et al. [357], Copyright 1974, 
with kind permission from the Amer- 
ican Institute of Physics 



only is 15%-20%. Up to about 10 GeV galactic electrons show a spectrum 
similar to the protons. They also show a modulation with solar activity in 
the energy range 0.1 to 1 GeV. 

Forbush Decreases. Interplanetary shocks not only accelerate energetic 
particles, they also block part of the galactic cosmic radiation in the hun- 
dreds of megaelectronvolt to gigaelectronvolt range. This is called a Forbush 
decrease. Forbush decreases occur in two steps [28, 162]: first, there is an 
intensity decrease as the shock passes by, followed by a more pronounced 
decrease as the observer enters the ejecta driving the shock [86]. Typical val- 
ues at 500 MeV are about 2 % for the shock decrease and about 5% for the 
decrease related to the arrival of the ejecta. While both decreases are rather 
abrupt (within minutes), the recovery phase is long and can last for days. 
Two examples are shown in Fig. 7.31. 

Decreases in the galactic cosmic radiation are not only found at travelling 
interplanetary bow shocks but also at the shocks at corotating interaction 
regions. Amazingly, the modulation of GCRs by CIRs continues even to lat- 
itudes well above the streamer belt where the CIRs form: while the plasma 
instruments on Ulysses could neither detect shocks nor the changes in solar 
wind effects, the particle instruments still detected the recurrent modulation 
of GCRs associated with CIRs that had formed at lower latitudes [485]. 
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Fig. 7.30. Yearly running averages of Mount Washington neutron monitor GCR 
intensities {dashed) and monthly sunspot numbers (solid) from 1954 to 1996 [327]. 
Note the reversed scale in sunspot numbers: sunspot numbers are high during GCR 
minima and low during GCR maxima. Reprinted from J.A. Lockwood and W.R. 
Webber, J. Geophys. Res. 102, Copyright 1997, American Geophysical Union 



CR-B Relation. When a corotating interaction region collides with a trav- 
elling interplanetary shock or another corotating interaction region, it forms 
a merged interaction region (MIR). Three types of MIRs can be distin- 




Fig. 7.31. Two-step Forbush decreases. The smooth line suggests the shape of 
the Forbush decrease after subtraction of the local ejecta effect (shaded). The bot- 
tom panel gives the standard deviation of counting rates [561]. Reprinted from G. 
Wibberenz, in L.A. Fisk, J.R. Jokipii, G.M. Simnett, R. von Steiger, and K.-P. 
Wenzel (eds.). Cosmic rays in the Heliosphere, Copyright 1998. Kluwer Academic 
Publishers 
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guished [73]: global merged interaction regions (GMIRs), corotating merged 
interaction regions (CMIRs) and local merged interaction regions (LMIRs). 

Global merged interaction regions (GMIRs) are shell-like structures with 
intense magnetic fields extending around the Sun up to high latitudes. They 
originate in the interactions of transient and corotating MIRs and produce 
step-like intensity decreases in galactic cosmic rays throughout the helio- 
sphere which, in turn, produce most of the modulation [404,418]. GMIRs are 
long-lived (1.5-1. 8 years), they might even extend over the poles. 

Corotating merged interaction regions (CMIRs) are MIRs with spiral 
forms associated with the coalescence of two or more corotating interaction 
regions. CMIRs and rarefaction regions generally produce several successive 
decreases and increases in GCRs over several months while the background 
intensity stays roughly constant. They do not lead to an appreciable net 
modulation. 

Local merged interaction regions (LMIRs) are non-corotating MIRs with 
a limited longitudinal and latitudinal extend. Most likely, they are formed 
by interactions among transient and perhaps corotating flows. Their effect 
on galactic cosmic rays is local, comparable to the typical Forbush decrease 
observed on Earth. 
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Fig. 7.32. Daily averages of the magnetic field strength normalized by the Parker 
magnetic field strength Bp and cosmic ray intensity of > 70 MeV protons [73]. 
Reprinted from L.F. Burlaga, J. Geophys. Res. 98, Copyright 1993, American Geo- 
physical Union 
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Figure 7.32 shows daily averages of the intensities of > 70 MeV/nucl 
galactic cosmic rays and the magnetic field from the beginning of 1986 to the 
end of 1989 for Voyagers 1 and 2. Note that these intensities are measured 
on an outward propagating spacecraft, thus the long-term variations include 
effects due to the radial gradient (see below). The most important results are: 
(1) GCRs decrease when a strong GMIR moves past the spacecraft (time pe- 
riods D and D'). (2) GCRs tend to increase over periods of several months 
when MIRs are weak and the strength of the magnetic field is relatively low 
(R and R'). (3) GCRs fiuctuate about a plateau when MIRs are of interme- 
diate strength and are balanced by rarefaction regions (time period P when 
CMIRs passed Voyager 1). Some local merged interaction regions (L1-L3) 
produce step-like intensity decreases; however, they are observed locally on 
one Voyager spacecraft only. 

From the close correlation between increasing magnetic field strength and 
decreasing cosmic ray intensity, Burlaga et al. [72] suggested an empirical 
relation, the CR-B relation, between the change in GCR intensity J and the 
magnetic field strength B relative to the Parker value Bp: 



^=-^1 

d^ ' 


V^p / 


for B > Bp 


(7.50) 


d^ 




for B < Bp 


(7.51) 



with D and R being constant. 

Radial Gradients. Spatial variations in cosmic ray intensity can be repre- 
sented as 

j dJ = dr + g\ dA (7.52) 

with the local radial (latitudinal) intensity gradient {g\) defined as 

Actual measurements are made between two often widely separated space- 
craft, thus only an average, non-local gradient can be determined. Data sam- 
pled over successive solar minima by the Pioneer and Voyager spacecraft indi- 
cate that the radial gradient might be a function of heliospheric distance [174] 

9r = Gor^ . (7.54) 

Both Go and a show a rather complex dependence on time [174]. General 
features of these variations are: at solar maxima radial gradients are larger 
than at solar minima and there is a significant change in the radial dependence 
of gr, in particular, a might change sign during the solar cycle. At solar 
minima there is a strong decrease in gj- with increasing r and the magnitude 
of pr is appreciably larger in qA < 0 (1981) than in > 0 epochs (1977). 
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Latitudinal Gradients. Latitudinal gradients are less well studied than 
radial ones: Ulysses is the first spacecraft to reach heliographic latitudes of 
80°, the only other spacecraft outside the ecliptic plane is Voyager 1 at ~ 34°. 

Figure 7.33 shows a comparison of IMP and Ulysses > 106 MeV counting 
rates between early 1993 and the end of 1996. Ulysses radial distance and 
heliographic latitude are given at the bottom of the figure, IMP is at 1 AU in 
the ecliptic plane. From the IMP data, the recovery of OCRs in the declining 
phase of solar cycle 22 is evident. Owing to the spacecraft orbit, galactic cos- 
mic rays on Ulysses show a more complex time development: at the beginning 
of the time period under study Ulysses slowly moves inward and to higher 
latitudes, passing the Sun’s south pole in fall 1994 at a maximum southern 
latitude of 80°S at a radial distance of 2.3 AU. Within 11 months, Ulysses 
performs a fast latitude scan up to 80° N. Afterwards, Ulysses descends slowly 
in latitude and moves outwards. During the fast latitude scan Ulysses crosses 
the ecliptic plane at a radial distance of 1.3 AU. At this time, GCR intensities 
on both spacecraft agree, their difference is largest when Ulysses is over the 
poles. The fast latitude scan is most suitable to study latitudinal gradients 
because the radial distance of Ulysses does not vary strongly (thus results 
are not affected by radial gradients) and the time period is rather short and 
GCR intensities at the Earth’s orbit are roughly constant, indicating that 
temporal variations are negligible too. 

From the data in Fig. 7.33 a latitudinal gradient of ~ 0.3%/degree can 
be determined [221]. However, significant latitudinal effects are only observed 
when Ulysses is totally embedded in the high-speed solar wind streams of the 
coronal holes. As long as fast and slow solar wind streams can be observed, 
the latitudinal gradient vanishes. The latitudinal variation in galactic cosmic 




Fig. 7.33. Twenty-six day running mean quiet time counting rates of > 106 MeV 
proton observed by KET on Ulysses and by the UoC Instrument on IMP 8 from 
1993 to the end of 1998 [221]. The fluctuations are caused by CIRs. Reprinted 
from B. Heber et al., J. Geophys. Res. 103, Copyright 1998, American Geophysical 
Union 
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rays is not symmetric around the heliographic equator but has an offset 
to 7-10°S [220,488], indicating an offset of the heliospheric current sheet 
towards the south. This offset is confirmed by the solar wind and magnetic 
field observations (Sect. 6.5). 

Electrons. Most information about galactic cosmic rays are obtained for 
nuclei, electron observations are rather sparse [148]. In general the modula- 
tion of cosmic ray electrons is similar to that of nuclei, although some clear 
differences exist: (a) the slope of the electron spectrum below 100 MeV is neg- 
ative because it is dominated by Jovian electrons; and (b) the ratio between 
electrons and helium strongly depends on the polarity of the solar cycle [181], 
indicating a charge dependence of modulation. 

7.7.2 Modulation Models 

The galactic cosmic ray intensity is anti-correlated with the sunspot num- 
ber and thus solar activity. But how can we understand this behavior? The 
galactic cosmic rays have to propagate from the heliopause towards the inner 
heliosphere and it appears that during solar maximum fewer particles manage 
this task. What is hindering them and which forces do they experience? 

Like solar particles, galactic cosmic rays basically experience the Lorentz 
force: they travel field parallel, they can drift in the large-scale structures 
of the field, and they are scattered at the magnetic field irregularities. In 
addition, the particles experience convection and adiabatic deceleration and 
they are blocked and refiected at transient inhomogeneities such as magnetic 
clouds and shocks. With increasing solar activity the structure of the in- 
terplanetary field changes: first, the tilt angle increases, leading to a wavier 
heliospheric current sheet (see Fig. 6.20); second, shocks and the ejecta driv- 
ing them disturb the interplanetary medium. 

Diffusion. Galactic cosmic rays propagate inwards into the solar system 
and we are basically interested in their advance by a piece Ar in radius, 
although the actual motion is a gyration around the field line. For solar 
energetic particles observations indicate that diffusion perpendicular to the 
field is negligible (Sect. 7.3). This partly is due to the fact that the magnetic 
field line is almost radial close to the Sun. At large heliocentric distances, 
the field line is tightly wound up and therefore almost circular around the 
Sun. Thus, even if the diffusion coefficient kj_ perpendicular to the field is 
much smaller than that parallel to the field, the net transport in the radial 
direction might be more efficient for particles crossing from one field line to 
a neighboring one instead of moving all the way along the field line. 

The perpendicular diffusion coefficient depends on the particle speed, the 
Larmor radius and the perpendicular mean free path A_l [166]: 

_ vri^ h/'^L 
3 l + (Ax/rL)2 



(7.55) 
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or for Aj_ ^ rp: 



_ rp _ ... 

3 Ax ’ ( ■ ) 

Parallel and perpendicular diffusion coefficients can be combined to yield a 
radial diffusion coefficient /^rr 



i^TT = ^11 cos^ 'ip sin^ -0 
where 0 is the spiral angle. 



(7.57) 



Drifts. Owing to their high speeds, galactic cosmic rays have large gyro- 
radii of the order of AU. Thus, they can drift in the large-scale structure of 
the heliosphere (see Fig. 7.34). In the heliosphere, the most important drifts 
are curvature drift (2.55) and gradient drift (2.54). The latter is extremely 
efficient along the heliospheric current sheet (see Fig. 2.2). Drifts in the mean 
Archimedian spiral pattern can be characterized by a drift velocity [391] 
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(7.58) 



Drift then results in a convection of particles with the drift velocity vb in the 
transport equation (7.60). Drift speeds can be several times the solar wind 
speed; thus, drifts can by far exceed convection with the solar wind [258]. 

In contrast to diffusion and convection with the solar wind, drift depends 
on the polarity of the magnetic field: drift directions are reversed when the 
magnetic field polarity is reversed. In addition, drifts are charge-dependent 
with electrons and nuclei drifting into opposite directions. Thus, the consid- 
eration of drift effects in modulation models leads to a charge dependence 
and different features in cycles with opposite magnetic field polarities. 




Fig. 7.34. Simplified sketch 
of the heliosphere and the 
energetic particles. In reality, 
the heliosphere has a shape 
similar to the magnetosphere 
only in that the deformation 
is due to the interstellar wind. 
Note the different shapes of 
magnetic field lines originat- 
ing from the equatorial and 
the polar regions 
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Formally, the drift also can be included in the antisymmetric terms of 
the diffusion tensor 

/ /^,| 0 0 \ 

K = I 0 K± I . (7.59) 

\ 0 Kj_ J 

The drift speed can be obtained as the divergence of the antisymmetric part. 

Transport Equation. The development of the cosmic ray density U over 
time can be described by a transport equation [394, 396] 

^ = V ( k * VC) - (Vsowi + Vj,) • VC + 1 V . (7.60) 

Here T is the particle kinetic energy, a — (T + 2mo)/(T + mo) with mo being 
the particle rest mass, k® the symmetric part of the diffusion tensor, and t;D 
the drift velocity. The terms on the right-hand side then give the diffusion 
of particles in the irregular magnetic field, bulk motion due to the outward 
convection with the solar wind and particle drifts, and adiabatic deceleration 
resulting from the divergence of the solar wind flow. Note that no effects of 
transient disturbances are included, thus (7.60) should be applied only during 
the relatively undisturbed conditions during solar minima. 

The Modulation Parameter. The first attempts in modeling modulation 
reduced (7.60) to a simple diffusion-convection equation. For quasi-stationary 
conditions dUfdt 0, a roughly isotropic cosmic ray flux and a spherical- 
symmetric geometry, a modulation parameter ^ could be determined [187] 



f t^sowidr 

J 3/^(r,P) 



(7.61) 



with r the radius at which the observer is located, R the outer boundary 
of the modulation region and /^(r, P) the diffusion coefBcient. Physically, ^ 
roughly corresponds to the average energy loss of inward propagating parti- 
cles due to adiabatic deceleration. This energy loss might be several 100 MeV 
for particles travelling from the outer boundary to 1 AU. Thus, particles with 
energies below some hundred MeV / nucl in the interstellar medium are com- 
pletely excluded from the vicinity of Earth. Or, in other words: near-Earth 
observations at energies below a few hundred MeV/nucl do not provide any 
information regarding the spectrum of the local interstellar medium at these 
energies. The part of the interstellar spectrum blocked by modulation is not 
negligible, it contains about 1/3 of the GCR pressure or energy density. 



The Importance of Drifts. Although drifts were included in the original 
transport equation (7.60), they were generally neglected until in the late 
1970s Jokipii et al. [258] pointed out that the inclusion of drift effects may 
profoundly alter our picture of modulation. 
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In the heliosphere, the following drift pattern arises: In an > 0 cycle 
(the Sun’s magnetic field in the northern hemisphere is directed outwards, 
the configuration in the 1970s and 1990s) positively charged particles drift 
inwards in the polar regions, downward to the heliomagnet ic equator, and 
outward along the neutral sheet. The sense of drift is reversed for an A < 0 
cycle. At the termination shock there is a fast drift upward along the shock. 
Note that drift itself does not cause modulation but only changes the path 
along which particles enter the heliosphere. Modulation itself can only happen 
if also transient disturbances and MIRs are present or if the tilt angle changes, 
which in turn alters the drift path of the particles: with increasing tilt angle, 
the waviness of the heliospheric current sheet increases and the drift path in 
the current sheet becomes longer. 

The relative roles of drift and diffusion are crucial for our understanding 
of modulation. For typical conditions, diffusion dominates drift on small time 
scales. On longer time scales, however, drift effects can accumulate and there- 
fore can become important compared with diffusion. Two conditions have to 
be fulfilled: (a) noticeable effects of drift can only be expected if the particle 
spends enough time in the heliosphere to drift at least a significant portion 
of 7t/2 in latitude, (b) Perpendicular diffusion should not be too strong to 
wash out the drift pattern. If perpendicular diffusion was too strong, particles 
would not drift along the polar axis or neutral sheet but would spread in lat- 
itude. This spread depends on the ratio k±/k7-: if k,± drift dominates, 

while for <C k,± diffusion destroys the drift pattern. For intermediate 
cases, both effects have to be considered. 

The inclusion of drift effects in the transport of cosmic rays leads to the 
following consequences [416,417]: 

• A polarity-dependent 11-year cycle with a pronounced maximum m a, qA < 
0 cycle and a flat plateau-like maximum maqA> 0 cycle, which is observed 
(see Fig. 7.30). 

• A correlation of modulation with the tilt angle in qA < 0 cycles only when 
positively charged particles travel inwards along the heliospheric current 
sheet and a larger tilt angle automatically implies a longer drift path. 

• A charge asymmetry which implies differences in, for instance, electron to 
helium ratios in different polarity cycles. 

Since these features are observed, the importance of drifts is without doubt. 
The details of the modulation process and the relative importance of the 
different processes, however, are still subject to debate. A state-of-the-art 
review with many accompanying papers on detailed problems can be found 
in [47,160]. 
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7.8 What I Did Not Tell You 

Some of the simplifications we have encountered in this chapter are conse- 
quences of problems already mentioned in Sect. 6.9. For instance, we show 
a sketch of the shock and the ejecta in Fig. 7.24, but our idea is based on 
one-point observations only: neither have we measured the shock parameters 
at a fixed time at different positions along the shock front nor have we ob- 
served the entire shock at different times. At most, we can have observations 
from a few points at different times and positions, as suggested in Fig. 7.25. 
Thus we do not even know whether the shock front is smooth and continu- 
ous, as suggested in the figure, or whether it more closely resembles a Swiss 
cheese, with holes and excursions depending on the varying properties of the 
upstream interplanetary medium. 

And we again encounter the problem of classification. For instance, from 
Table 7.2 we learn that impulsive events are electron-rich and gradual events 
are proton-rich, or, in other words, that the electron-to-proton ratio is higher 
in impulsive than in gradual events. But we should be careful, and look into 
the data [84,272]: it is not meant that the e:p ratio in each impulsive event 
is higher than that in any gradual event. Instead, it is meant that on average 
the e:p ratio is higher in impulsive than in gradual events. Thus, instead of 
two distinct distributions for e:p, we find a big overlap in the distributions 
for the two classes and only the averages are different; see Fig. 7.35. Again 
this situation is fairly similar to that for the height distribution of males 
and females: if we pick out one event, we cannot tell from the e:p ratio 
whether it is impulsive or gradual, just as we cannot tell from the height of 
an individual whether that person is male or female. And if we pick out one 
impulsive and one gradual event, the one with the larger e:p ratio might be 
the gradual one. This not only points to the problem of a classification based 
on phenomenological criteria, as already described in Sect. 6.9, but also gives 
a constraint on model development: any model to describe the differences in 
particle events from impulsive and gradual flares must account not only for 
the difference in one property but also for the large scatter in the properties 
in both classes. 




Fig. 7.35. The properties of gradual and 
impulsive SEP events are different on aver- 
age, but the distributions for the two classes 
of events scatter over a broad range and 
overlap 
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Like the plasma observations, particle observations are one-point in situ 
measurements. The plasma is convected over the observer with the solar wind, 
that is, its motion relative to the observer is rather regular and simple. As 
we have already seen in the discussion of upstream turbulence generated by 
energetic particles, particles are fast and, owing to scattering, travel back and 
forth. That is the reason why particles can be used as probes of the structure 
of the interplanetary medium, for instance when we study scattering condi- 
tions. However, this running ahead also implies that a particle event might 
be influenced by a structure in the solar wind from a much earlier event. This 
is not considered in our present methods when interpreting particle events: 
normally, we relate the intensity profile to a parent flare and the accompany- 
ing CME and shock. But this picture might be oversimplified. For instance, 
as early as in the 1970s Levy et al. [322] suggested that extremely large par- 
ticle events might result from a fast shock, accompanying the flare and event 
under consideration, running towards a slower shock from an earlier event. 
In that picture, particles are trapped between the shocks and are accelerated 
as the distance between the shocks decreases because of the second adiabatic 
invariant. Such a scenario can be applied to many of the larger events [273]; 
modeling now considers even the CMEs [274]. The Bastille Day event in 
Fig. 7.26 is one likely candidate for such an event: the first shock in the rising 
phase is from an earlier event and might act as the barrier that prevents 
particles from escaping to larger distances and might even lead to further 
acceleration by a Fermi I process. On a shorter time scale and smaller spatial 
scales, the interaction between CMEs has also been discussed as a possible 
requirement for efficient particle acceleration: CMEs can catch up with each 
other even in the field of the coronograph. These cannibalizing CMEs [197] 
show some peculiarities in the plasma parameters which can be observed in 
situ in the interplanetary medium, and it appears that cannibalizing CMEs 
are more efficient in particle acceleration than single CMEs [198], although 
the detailed mechanisms are not understood yet. 



7.9 Summary 

Energetic particles in interplanetary space originate from various sources, 
such as planetary bow shocks, travelling and corotating shocks, and solar 
flares, or from outside the solar system. The corresponding populations have 
characteristic spectra, compositions, and time profiles, providing information 
about the acceleration and propagation mechanism. For almost all particle 
populations, shock acceleration is important; in solar flares, reconnection and 
selective heating are also at work. The particles, in particular solar energetic 
particles, can also be used as probes of the magnetic structure of the inter- 
planetary medium and the superimposed turbulence. 
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Exercises and Problems 

7 . 1 . Determine the Larmor radius, gyro-period, and speed of galactic cosmic 
rays with an energy of 10 GeV in a 5 nT magnetic field. Compare with the 
same values for a solar proton with an energy of 10 MeV. Determine the 
travel time between the Sun and the heliopause at 100 AU for a straight 
path and a path following an Archimedian magnetic field line. 

7 . 2 . Assume a Galton board with n rows of pins. For each pin, the possibility 
of a defiection to the left or right is 0.5. (a) Give the probability distribution 
in the nth layer, (b) Show that for large n this distribution converges toward 
the bell curve. Give the standard deviation, (c) Write a small computer pro- 
gram to simulate a Galton board. Compare the runs of your simulation with 
the expected result for a different number of rows. Alternatively, simulate 
the results for a Galton board with 5 rows and 100 balls by tossing a coin. 
Compare with the expected results. 

7 . 3 . Get an idea about changes of time scales in diffusion. Imagine a horde of 
ants released at time to = 0 onto a track in the woods. The speed of the ants is 
1 m/min, their mean free path 10 cm. How long do you have to wait until the 
number of ants passing your observation point at 1 m (10 m, 100 m) is largest? 
How do your results change if you get faster ants (10 m/min, 50 m/min) or 
ants moving more erratically (mean free paths reduced to 5 cm, 1 cm). Can 
you imagine different populations of ants characterized by different speeds 
and different mean free paths reaching their maximum at the same time at 
the same place? (More realistic numbers for interplanetary space: particle 
speeds of 0.1 AU/h, 1 AU/h, and 6 AU/h, distances of 0.3 AU, 1 AU, 5 AU, 
and mean free paths of 0.01 AU and 0.1 AU). 

7 . 4 . In interplanetary space, propagation should be described by the diffusion- 
convection equation instead of a simple diffusion equation. The fiow speed 
of the solar wind is about 400 km/s. Calculate profiles with the diffusion- 
convection equation with the numbers given in the parentheses for Problem 
7.3. Compare with solutions of the simple diffusion model. Discuss the dif- 
ferences: how do they change with particle speed and mean free paths and 
why? (Note: Solving this problem you should get an idea about the infiuence 
of convection. And this infiuence is quite similar when additional processes 
in the transport equation are considered too.) 

7 . 5 . Explain the shape of k{/i) in Fig. 7.11 for isotropic scattering. Why is it 
not a straight line? 

7 . 6 . Shock acceleration is important for many of the particle populations dis- 
cussed in this chapter. Describe them and find arguments for the differences, 
in particular the maximum energy gained by the different populations. 
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7 . 7 . In Fig. 7.4 the composition slowly evolves from one characteristic of 
flare acceleration to another one characteristic of shock acceleration. Can you 
explain this slow evolution in terms of a 5-like solar acceleration, a continuous 
acceleration of particles at the shock, and interplanetary propagation? 

7 . 8 . An interplanetary shock propagates with a speed of 800 km/s in the 
space craft frame into a solar wind with a speed of 400 km/s. The ratio 
of upstream to downstream flow speed in the shock rest frame is 3, and 
the upstream diffusion coefficient is 10^^ cm^/s. Determine the characteristic 
acceleration time. Determine the power-law spectral index for times longer 
than the acceleration time. 

7 . 9 . A shock propagates with a speed of 1000 km/s through interplanetary 
space. The solar wind speed is 400 km/s. The particle instrument on a space- 
craft observes an exponential intensity increase by two orders of magnitude 
starting 3 h prior to shock arrival. Determine the diffusion coefficient in the 
upstream medium (losses from the shock can be ignored, the shock is assumed 
to be quasi-parallel). 

7 . 10 . Perpendicular transport in modulation: compare the travel paths of a 
particle at r = 80 AU if the particle has to follow an Archimedian spiral 
around the Sun for one turn of the spiral and if it travels the same distance 
straight along a radius. 
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The poet’s eye, in a fine frenzy rolling, 
Doth glance from Heaven to Earth, from Earth to Heaven, 
and, as imagination bodies forth 
the forms of things unknown, the poet’s pen 
turns them to shapes, and gives airy nothing 
a local habitation and a name. 
W. Shakespeare, Much Ado About Nothing 



A magnetosphere is shaped by the interaction between a planetary magnetic 
field and the solar wind. The magnetopause is a discontinuity separating both 
fields, forming a cavity in the solar wind. Since the solar wind is a supersonic 
flow, a standing shock wave, the bow shock, develops in front of the mag- 
netopause. In the anti-sunward direction, the magnetosphere is stretched by 
the solar wind, forming the magnetotail. Inside the magnetosphere, different 
plasma regimes exist, dominated by ionospheric plasma in the plasmasphere, 
a highly variable mixture of ionospheric and heliospheric plasma in the geo- 
sphere, and by the solar wind plasma in the outer magnetosphere. These 
different regimes are coupled by fields and currents. Inside the plasmasphere 
energetic particles are trapped in the radiation belts. The inner magneto- 
sphere can be approximated as a slightly distorted dipole held. It is coupled 
to the ionospheric current system, with energy in the form of particles and 
waves exchanged between both regimes. Both ionospheric currents and the 
ring current associated with the radiation belts modify the dipole held. 

Particles and energy are fed into the magnetosphere from different sources: 
(a) the solar wind can penetrate into the magnetosphere due to reconnection 
at the dayside (flux transfer events), convection above the polar cusps, and 
and diffusion into the magnetotail, (b) solar energetic particles can penetrate 
into the magnetosphere at the polar cusps, (c) galactic cosmic rays travel- 
ling along St0rmer orbits even can penetrate down to ground level, and (d) 
plasmas are exchanged between the ionosphere and the magnetosphere. 

The magnetosphere was recognized as a dynamic phenomenon as soon 
as the first systematic magnetic held measurements at the ground became 
available: aside from diurnal variations, strong transient disturbances can 
be observed. These magnetic storms often are accompanied by aurorae and 
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might influence our technical environment, as evident in disruptions in radio 
communication or power-line breakdowns. All these phenomena are caused 
by strong fluctuations or discontinuities in the solar wind and can be related 
to changes in magnetospheric structure, in particular in the magnetospheric 
current system and in the plasma sheet inside the magnetotail. This chapter 
provides an introduction to these phenomena and a supplementary section 
about the aurora and the history of aurora research. 



8.1 The Geomagnetic Field 

Magnetic flelds either originate in currents or from magnetized bodies. The 
Earth is not a magnetized body, as can be seen from the variations in the 
terrestrial fleld, in particular the pole reversals. Instead, the terrestrial held 
originates in a dynamo process similar to the one working inside the Sun. 
Close to the Earth’s surface the fleld can be approximated as a dipole; at 
higher altitudes or under magnetically disturbed conditions it deviates from 
the dipole due to currents, the solar wind pressure, and plasma and fleld 
exchange with the interplanetary medium. A recent review about these geo- 
magnetic flelds and their variability is given in [82,355]. 



8.1.1 Description of the Geomagnetic Field 

To first order, the Earth can be described as a sphere magnetized uniformly 
along its dipole axis. This axis intersects the surface in two points, the austral 
(southern) pole at 78.3°S 111°E close to the Vostok station in Antarctica 
and the boreal (northern) pole at 78.3°N 69° W close to Thule (Greenland). 
Both positions are about 800 km from the geographic poles and the magnetic 
dipole axis is inclined by 11.3° with respect to the axis of rotation. The dipole 
moment Me of the Earth is 8 x 10^^ G cm^ or 8 x 10^^ A m^. 

Geomagnetic Coordinates. The geomagnetic coordinate system is ori- 
ented along the magnetic dipole axis. A plane perpendicular to the dipole 
axis intersecting the center of the Earth defines the equatorial plane. Its 
intersection with the Earth’s surface marks the geomagnetic equator. The 
geomagnetic longitude A and latitude ^ then are defined analogously to the 
geographic longitude A and latitude cp. With (po = 78.3°N and Aq = 291°E 
as the latitude and longitude of the boreal magnetic pole, the magnetic and 
geographic coordinates are related by the transformations 

sin^ = sm(p siiupo -h cos (p cos (/?o cos(A - Aq) (8.1) 



and 



. , cos(psin(A — An) 
sm A = - 



cos^ 



( 8 . 2 ) 
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The magnetic potential at a position r from the Earth’s center is 
_ Ho Me r _ Ho Me sini^ 

^ A A 9 * 

An r'^ An 

From this, the magnetic field strength B = — W can be derived: 

^ 2sin^er 4- cos^e^) . (8.4) 

An 

The fiux density 

B = + = + Ssin^ (8.5) 

falls off with distance as r^. At the Earth’s surface the magnetic field com- 
ponents can be approximated as 

B<p = Be cos ^ and B^ = 2 Be sin ^ (8.6) 



where 



Be 



mpMs 

(47ri?E)^ 



= 3.11 X 10"^ 



T 



(8.7) 



is the equatorial field at the Earth’s surface. At the pole B equals B^ while 
at the equator B equals B^. Thus the magnetic field strength at the pole is 
twice that at the equator. This ratio does not change with distance. 

The geomagnetic field can be described in different systems. In a rectan- 
gular Cartesian system, the triple (A, Y, Z) gives the northward, eastward, 
and vertical components. In a cylindrical system, the triple {D^H^Z) is used 
with Z as the vertical intensity (that is B^), H as the horizontal intensity 
(that is B^), and D as the declination of the field. In a spherical system with 
Z and X as the axes of reference, the field can be described by the triple 
(B, /, D) with total intensity B, inclination /, and declination D. Lines with 
constant declination D are called isogones. Lines with constant inclination I 
are isoclines. The line with / = 0° is the dip-equator or geomagnetic equator. 
The magnetic inclination is tan / = ZjH = B^jB^ — — 2tan^. Thus, close 
to the dip-equator the inclination increases twice as fast as the geomagnetic 
latitude. Figure 8.1 shows the relation between the different systems. 

The magnetic field components discussed so far have been intrinsic be- 
cause their reference direction is the magnetic field itself. The other compo- 
nents are relative ones, their reference direction is the geographic north. The 
declination D is defined as the angle between the magnetic field direction and 
the geographic north: D = Y/X. The northward and eastward components 
of the field then are given as A = H cos D and Y = H sin D. 

The equation of a field line r = r(^) can be inferred from (8.6). The 
magnetic field vector is always tangential to the line of force. The angle a 
between the radius vector and the magnetic field line is given as 
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Fig. 8.1. Components of the geomagnetic field at the 
Earth’s surface 



rd^ 1 
Bj: dr 2tan^ 



( 8 . 8 ) 



Thus we get 



dr ^ ^ ^ 2 sin ^ 

— = 2d^ tan ^ — 

r 



d^ 



d(cos^) 



cos ^ cos ^ 

Integration yields Inr = 21n(cos^) + const, which can be written as 



(8.9) 



r = Teq COS^ ^ , 



( 8 . 10 ) 



where req is the distance of the field line from the Earth’s center above the 
equator. This is also the largest distance of a field line; it is used to define 
the L-shell parameter Lq = req/i^E (see Fig. 8.2). The magnetic field then is 



B{Lo,^) 



Be \/l + 3 sin^ ^ 
Lq cos^ ^ 



( 8 . 11 ) 



and the equation of the field line can be written as L = I/qcos^^. The 
field line intersects the Earth’s surface at a latitude cos^e = l/\/^, where 
L = 1. Physically, the L-shell is the surface traced out by the guiding center 
of a trapped particle as it drifts around the Earth while oscillating between 




Fig. 8.2. Shape of magnetic dipole field lines and the definition of the L-shell 
parameter as the intersection between the field line and the equatorial plane 
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its northern and southern mirror points. Note that on a given L-shell the 
particle’s physical distance from the Earth’s surface might change while its 
‘magnetic distance’ stays constant. 

Multipole Expansion. A more correct expression for the geomagnetic field 
not too high above the Earth’s surface is a multipole expansion. In spherical 
coordinates r, 9 and (^, the potential V can be written as 

oo n 

= E ;3t E cosinup) + /i™ Sin(m(^)} P™(cos 6) . (8.12) 

n=l m=Q 

The and are normalization coefficients and the are the Legendre 
coefficients. For m > n, equals 0. The quantity n gives the order of 
the multipole: with n — 1, (8.12) describes a dipole field, with n = 2 a 
quadrupole. The magnetic monopole (n = 0) is not contained in (8.12). The 
potential of a dipole field can be inferred from (8.12) as E = (rs/r^)^? cos0. 

The coefficients in (8.12) are determined from fits to the measured mag- 
netic field. Because the field changes with time, the coefficients have to be 
adjusted too. The field determined with the multipole expansion is accurate 
to 0.5% close to the Earth’s surface. 

The Surface Field: Shift of the Dipole Axis. Fits on the field measured 
close to the Earth’s surface reveal an offset of the magnetic dipole relative to 
the center of the Earth by 436 km in the direction of the westerly Pacific. This 
offset leads to a region of unusually small magnetic flux density in the south 
Atlantic, just off the coast of Brazil, the South Atlantic Anomaly (SAA); see 
Fig. 8.3. Since the radiation belts are roughly symmetric around the dipole 
axis, they come closest to the Earth’s surface, too, making the SAA suitable 
for the study of radiation belts by rockets, but also making it a radiation risk 
for manned space- flight. 

Deviations from the Multipole: The Outer Field. With increasing 
height, the shifted multipole approximation becomes less efficient. The im- 
portance of the higher moments of the multipole decrease, but the field does 
not become dipole-like. Instead, it is distorted by the influence of electric 
currents in the ionosphere and magnetosphere and by the direct action of 
the solar wind. While the solar wind strongly modifies the structure of the 
outer magnetic field, only the influence of the ring current can be observed 
at the surface: the ring current results from the opposite drifts of electrons 
and protons in the radiation belts and gives rise to a magnetic field opposite 
to the Earth’s field, thereby reducing the magnetic flux density. 

8.1.2 Variability of the Internal Field 

With the beginning of systematic measurements of the terrestrial magnetic 
field in the middle of the nineteenth century, its variability became evident: 
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Fig. 8.3. International geomagnetic reference field. Contour intervals are 1000 nT. 
In a pure centered dipole field the iso-intensity lines would be horizontal. For the 
most recent list of coefficients see www . agu . org/eos_elec/000441e . html. Reprinted 
from lAGA [250], EOS 67, Copyright 1985, American Geophysical Union 



a systematic daily variation (Sq variation) was occasionally superimposed by 
much stronger variations, the geomagnetic disturbances. Later it was discov- 
ered that the terrestrial magnetic field also varies with the solar cycle. On 
geological time scales, variations become more pronounced, including the re- 
versal of the field. The origins of these variations are quite different: daily 
variations result from the asymmetric shape of the magnetosphere. Magnetic 
storms and variations with the solar cycle reflect the variability of the solar 
wind and solar activity; variations on even longer time scales are related to 
the MHD dynamo inside the Earth. 

Pole reversals give strong evidence for a variability of the internal field 
and the dynamo process inside the Earth. Although we have not witnessed 
such a reversal, the different sheath of lava at the deep-sea trenches have 
preserved a record of magnetic field polarity (see Fig. 8.4). It appears that 
the typical cycle for field reversal is about 500 000 years; however, shorter 
polarity reversals, also called magnetic events, can be observed on time scales 
between a few thousand years and about 200 000 years. The last polarity 
reversal occurred about 30 000 years ago, a time when the early humans 
already had spread across the Earth and the Neanderthals still where alive. 
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Fig. 8.4. Polarity rever- 
sals of the geomagnetic 
field during the last 
5 million years as derived 
from lava records. Four 
magnetic field epochs are 
indicated on the left, and 
reversals on shorter time 
scales, magnetic events, 
are indicated on the right. 
Reprinted from D. Gub- 
bins [205], Rev. Geophys. 
32, Copyright 1994, 
American Geophysical 
Union 



At the time of polarity reversal, fossil records often indicate the extinc- 
tion of different plant species [253]. Today it is not clear whether this points 
to a causal relationship. Four models are discussed, (a) The weak or absent 
magnetic field at the time of polarity reversal allows the cosmic radiation 
to penetrate down to the biosphere, causing increased radiation damage in 
certain plant species and leading to the extinction of at least some of them. 
Although tempting, this interpretation probably will not hold as most of the 
cosmic rays are absorbed by the atmosphere well above the biosphere. The 
field reversal can be seen as an increase in the records of cosmogenic nu- 
clides produced by the interaction between cosmic rays and the atmosphere, 
such as or ^^Be; however, its amplitude is too small to be considered 
a biological hazard, (b) Certain species in the micro fauna are sensitive to 
the magnetic field. Thus field reversals might have changed the biochemical 
processes within these micro-organisms, leading to their extinction, or might 
have pushed them from their original habitat into a life-threatening one. 
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Since they are the start of the food chain, the extinction of micro-organisms 
also can lead to the extinction of other species, (c) Within the framework of 
catastrophe theory, there is discussion as to whether an external event might 
have caused both the polarity reversal and the extinction of species, (d) Mag- 
netic field polarity reversals are often associated with climate changes [567]. 
Although the mechanisms are not yet understood, it appears possible that 
they invoke an interaction between galactic cosmic rays and the atmosphere, 
in particular changes in the global circulation patterns in the stratosphere 
or upper troposphere, in the ozone column, or in cloud cover. Another ex- 
planation for a link between pole reversals and climate change might be a 
modified circulation and heat transfer pattern inside the Earth’s core which 
might lead to both changes in the heat fiux through the Earth’s surface and 
a modified dynamo process. 

But even if the magnetic field has a certain polarity, it is not necessar- 
ily constant. Instead, variations in the dipole moment can be found (see 
Fig. 8.5). For instance, about 2000 years BP, the magnetic dipole was almost 
50% stronger, as can be seen from the dotted curve, while about 25 000 years 
ago, the magnetic field had only half of its present value. 

Figure 8.6 shows the variation of the magnetic moment (top) and the 
location of the geographic north since 1600. Measurements are sparse in the 
early part of that period but frequent since the time of Gauss’s analysis in 
1835. Since that time, the magnetic moment of the dipole has decreased by 
about 5% per century, while the location of the north magnetic pole has been 
roughly constant. Extending the time period back to 1600, the decrease in 
magnetic moment, although slightly weaker, is again prominent, while the 
drift of the magnetic north is more pronounced: about 0.08° per year to the 
west and about 0.01° per year to the south. 

0 5000 10000 15000 20000 25000 years BP 




Fig. 8.5. Dipole moment of the terrestrial magnetic field for the last 27 000 
years {dotted line) and the last 2600 million years {solid line). Reprinted from K. 
Strohbach [507], Unser Planet Erde, Copyright 1991, with kind permission from 
Gebriider Borntraeger Verlag 
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Fig. 8.6. Variation of the magnetic dipole moment (top) and the position of the 
geomagnetic north pole (bottom) since 1600. Figure from A.C. Fraser-Smith [170], 
Rev. Geophys. 25, Copyright 1987, American Geophysical Union 



Information about recent work on geomagnetic variations and its ter- 
restrial consequences as well as useful links on this topic, can be found at 
www.tu-bs.de/institute/geophysik/spp/index_en.html. The identifica- 
tion of changes in the terrestrial magnetic field is only possible if accurate 
measurements are available. While, historically, ground-based observatories 
have been used, satellite measurements have the advantage of a global cov- 
erage. CHAMP (op.gfz-potsdam.de/champ/index_CHAMP.html) is one ex- 
ample of such a satellite. 

Thus not only the times of polarity reversal but also the strength of the 
geomagnetic dipole are distributed stochastically. For instance, in the pe- 
riod between 118 million and 83 million years BP, no magnetic field reversal 
occurred. For the last 5 million years, the statistical distribution of field re- 
versals can be described by an asymmetric random walk distribution [354]. 
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Such distributions are found if the observed process, here the polarity rever- 
sal, results from a large number of small individual events following a bell 
shape distribution. In the case of the terrestrial magnetic field, these small 
events probably are the patterns of the convection cells in the outer core 
which play an important role in the dynamo process. 



8.1.3 The Terrestrial Dynamo 

The principle of a MHD dynamo has already been discussed in Sect. 3.6. 
The special topology of the terrestrial dynamo is shown in Fig. 8.7. Both 
panels show the Earth’s core only, the combined processes give a complete 
description of the terrestrial dynamo. 

The motion of matter is depicted in the left panel: the liquid inside the 
outer core rotates but the angular speed decreases with increasing distance 
from the Earth’s center. This differential rotation is the consequence of the 
vertical transport of angular momentum: convection transports matter up- 
wards from the deeper layers of the outer core, where the linear speed is 
small, while matter from the higher layers, where linear speeds are high, is 
transported downwards. As a result, the inner part of the outer core rotates 
faster than its outer one. In Fig. 8.7, this difference is indicated as the speed 
v\ of the deeper layers relative to the outer layers. Let us now assume a mag- 
netic seed field B\ which is at rest relative to the outer layers and parallel to 
the axis of rotation. In the inner layers, the relative motion of the fluid with 
respect to the magnetic field causes an electric induction field E\ directed 
towards the axis of rotation. Since the matter inside the core is highly con- 

North 





Fig. 8.7. Origin of the terrestrial magnetic field in a dynamo process. Only the 
core of the Earth is shown. The processes sketched in both panels have to be viewed 
together. Reprinted from K. Strohbach [507], Unser Planet Erde^ Copyright 1991, 
with kind permission from Gebriider Borntraeger Verlag 
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ductive, a ring current j\ results which is directed counter-clockwise in the 
northern hemisphere and gives rise to a toroidal magnetic field B 2 . In the 
southern hemisphere, the current ji and the magnetic field B 2 have opposite 
directions. 

In addition, in the outer core convection takes place in the Taylor columns, 
that are rotating vertical columns. The toroidal magnetic field B 2 is pushed 
upward by the convective motion and simultaneously twisted by the Coriolis 
force, as indicated in the small loop in the lower left of the right panel in 
Fig. 8.7. The resulting magnetic field Bs is poloidal and adds to the initial 
field: the small seed field is amplified. 

This dynamo process can explain the basic features of the terrestrial mag- 
netic field. Many details, in particular the details of the polarity reversal, are 
not yet understood. One important ingredient for the terrestrial dynamo is 
the core’s differential rotation, leading to the poloidal field; the other is the 
helical twist of the field lines. Thus our description of the geomagnetic dy- 
namo is, as in the case of the solar dynamo, in terms of an aQ process. If the 
differential rotation were absent, no amplification of the seed field would be 
possible. This appears to be the case in the extremely slowly rotating Venus 
and the deep-frozen Mars. 



8.2 Topology of the Magnetosphere 

Now let us put the terrestrial dipole field into a magnetized plasma, the solar 
wind. How will the dipole be distorted by this flow? Which topology of the 
magnetosphere arises? How deep into the magnetic field does the influence 
of the solar wind extend? 

8.2.1 Overview 

The structure of the magnetosphere is best described in a frame of reference 
with a fixed Sun-Earth axis. The magnetosphere than stays fixed in space 
while the Earth rotates inside it. This system divides the magnetosphere 
into two parts, a dayside directed towards the Sun and a nightside facing 
the magnetotail. The corresponding directions in the equatorial plane are 
noon and night, and the direction perpendicular to it dusk and dawn, always 
referring to local time. 

The most important features of the magnetosphere are shown in the noon- 
midnight cross-section in Fig. 8.8. Typical extensions are about 10 te in 
the solar direction and more than hundred te tailwards. The bowshock, the 
magnetosheath and magnetopause, the cusps and the tail are indicated. These 
components are strongly determined by the interaction between the terrestrial 
magnetic field and the solar wind plasma. However, Fig. 8.8 also indicates that 
the magnetosphere is far from being homogeneous but is highly structured 
by different plasma and particle components. 
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Fig. 8.8. Noon-midnight cross-section through the magnetosphere. Reprinted from 
G.K. Parks [397], Physics of Space Plasmas, Copyright 1991, with kind permission 
from Addison- Wesley Publishing Company 

Starting from the surface of the Earth, on the way to interplanetary space, 
an astronaut will encounter different regimes of magnetospheric plasma. Rel- 
atively low in the atmosphere, at a height of about 70 km, the ionosphere 
begins. As a conductive layer, it forms the bottom of the magnetosphere. 
Above it, the plasmasphere, dominated by ionospheric plasma, extends up 
to a few Earth radii. The radiation belts are embedded in the plasmasphere. 
The plasmasphere still is relatively symmetric, except for a small bulge on 
the nightside. The overlying geosphere is more strongly influenced by the 
interaction between the geomagnetic field and the solar wind: it is highly 
asymmetric with a larger extension towards the tail. In addition, it is highly 
variable, at times being dominated by solar wind plasma while at other times 
ionospheric plasma is more abundant. The outer magnetosphere, the region 
between the geosphere and the magnetopause, is filled by a plasma of solar 
wind origin although the magnetic field still is the geomagnetic one. 



8.2.2 The Magnetopause 

Let us first have a look at a boundary between the solar wind regime and the 
terrestrial one. This boundary is called the magnetopause and is defined as an 
equilibrium between the solar wind kinetic pressure and the pressure of the 
terrestrial magnetic field. The interplanetary magnetic field and the thermal 
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pressure of the solar wind do not contribute significantly to this balance; 
their combined pressure is less than 1% of the plasma kinetic pressure. For a 
similar reason, the gas-dynamic pressure inside the magnetosphere does not 
enter into the balance: it is too small compared with the magnetic pressure. 

In the pressure balance we have to consider the geometry of the field 
and the plasma flow. Since the magnetic pressure is anisotropic, only the 
tangential magnetic field contributes to the magnetic pressure: 



- ML 

“ 2mo 



(8.13) 



If we assume the magnetopause to be a perfect boundary between the solar 
wind and the terrestrial field, equals zero at the magnetopause. If we de- 
scribe the solar wind as an electron and ion plasma flowing at an angle xjj with 
respect to the normal direction on the magnetopause, within a second each 
surface element of the magnetopause is hit by ngowit^sowi cos'^ particles, with 
^sowi being the number density and t^sowi the bulk speed of the solar wind. 
These particles transfer a momentum 2mnsowit/sowi ^ ~ 

The pressure balance at the magnetopause therefore can be written as 






K 



(8.14) 



Note that this is a simplification because we have not considered a slow-down 
of the solar wind as it passes through the bow shock and part of the flow 
energy is converted into thermal energy; see Sect. 8.2.5. Nonetheless, (8.14) 
is still valid as long as the factor 2 in the kinetic pressure is substituted by a 
factor K <2, which at the terrestrial bow shock is about 0.88. 

Let us assume the magnetosphere to be axisymmetric. In cylinder coor- 
dinates r is the radial distance from the axis of symmetry and s the distance 
along the magnetopause, measured from the subsolar point. With dr/ds = 
cos'0 we can determine the cosine in (8.14). If Bt were known, the position of 
the magnetopause could be determined. But Bt is known only as the solution 
of a potential problem with the boundary conditions defined by (8.13). Thus 
(8.14) has to be solved iteratively: we make an assumption for Bt which is 
a solution of (8.13). Then we can determine the electric currents inside the 
magnetopause, which in turn give a new Bt, which again can be used as input. 

As a crude measure for the size of the magnetosphere, the position of 
its subsolar point on the Sun-Earth line is used. Here the plasma flow is 
perpendicular to the magnetopause and cos'i/j equals 1. Bt can be approx- 
imated from Bo, the magnetic field at the Earth’s surface. If we assume a 
mirror dipole at a distance 2d, the normal component of the magnetic field 
would vanish if the tangential field were doubled in d: Bt = 2Bojd^. Then 
the distance of the subsolar point can be determined from (8.14): 
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For typical solar wind conditions, the subsolar point, or stand-off distance, is 
at lOrE- 

As can be seen from (8.15), the stand-off distance depends on the solar 
wind speed, and to a lesser extent, also on its density. The variability of 
the solar wind conditions thus leads to continuous changes in the size and 
therefore also in the shape of the magnetopause. Depending on the solar 
wind speed, the subsolar point is between 4.5 and 20 te- Since solar wind 
speed changes can be quite abrupt, e.g. across a travelling interplanetary 
shock or when a fast stream is suddenly swept across Earth, the magne- 
topause has to adjust to this changed environment rather fast. Satellite ob- 
servations indicate speeds of the magnetopause between a few kilometers per 
second in response to solar wind fluctuations and up to about 600 km/s in 
response to discontinuities. The average speed of the magnetopause is about 
40 km/s. 

Although the magnetopause is defined as a three-dimensional boundary 
at which an equilibrium between the solar wind and the planetary magnetic 
field is established, it is not infinitely thin. Instead, it is an extended sheath 
with a thickness between a few hundred up to thousand kilometers: the so- 
lar wind is reflected at the magnetopause only after it has penetrated into 
the magnetic field and has been turned around by the Lorentz force (see 
Fig. 8.9). Since the Lorentz force depends on the charge, electrons and ions 
are deflected in opposite directions, forming the Chapman-Ferraro current 
inside the magnetopause. This charge separation leads to a pile-up of charges 
at the flanks of the low-latitude magnetosphere (low-latitude boundary layer 
(LLBL)) with an excess of positive charges on the dawn side and negative 
charges on the dusk side. This can also be interpreted as a dawn-to-dusk elec- 
tric field. Field lines intersecting the LLBL map back this potential pattern 
towards the high-latitude ionosphere. Relevant aspects concerning the LLBL 
are summarized in a series of articles in [377]. 



Fig. 8.9. The de- 
flection of solar wind 
electrons and protons 
is responsible for both 
the finite thickness of 
the magnetopause and 
the Chapman-Ferraro 
current inside it. 
Based on [563] 
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This deflection can also be described as an extreme case of the grad B 
drift, with the magnetic fleld vanishing on one side of the boundary. The 
drift speed, as given by (2.54), only depends on the particle’s Larmor ra- 
dius which also deflnes the depth at which the particles penetrate into the 
magnetopause. Since the Larmor radius depends on m/\q\, ions penetrate 
deeper into the magnetopause than electrons, leading to an excess of nega- 
tive charges in the outer magnetopause and an excess of ions in the inner one. 
The resulting electrical polarization fleld accelerates (decelerates) an electron 
(ion) on entering the magnetopause and decelerates (accelerates) the particle 
on leaving it. Thus the particles do not travel along semicircles, but along 
elliptical orbits as sketched in Fig. 8.9. 

The physics of the magnetopause is discussed in details in a series of 
articles in [497]. 

8.2.3 Polar Cusps 

The polar cusps are two singularities in the dayside magnetosphere: here the 
magnetic fleld vanishes and particles and plasma can penetrate freely into 
the magnetosphere. The polar cusps separate closed fleld lines on the dayside 
magnetosphere from open fleld lines swept to its night side. The cusps are 
not located at the dipole axis but at lower geomagnetic latitudes because 
the higher latitude field, which in the dipole fleld still would close on the 
dayside, are convected with the solar wind to the night side magnetosphere. 
The magnetic fleld lines connect the cusps back to geomagnetic latitudes of 
about 78°; they are the only ones connecting the surface of the Earth to the 
magnetopause. Thus all fleld lines of the magnetopause converge at the cusps. 
The cusps themselves are fllled with plasma from the magnetosheath but not 
from the magnetosphere. Thus at the cusps, plasma of solar wind origin can 
penetrate deep into the Earth’s atmosphere, as can energetic particles. 



8.2.4 The Tail and the Polar Caps 

Magnetic fleld lines extending from the cusps to the nightside form the bound- 
ary of the magnetotail. Close to the Earth, in addition to these open fleld 
lines also closed fleld lines can be found inside the tail, preserving, at least 
partly, the dipole character of the inner magnetosphere. As on the dayside 
magnetosphere, the closed fleld lines originate in geomagnetic latitudes below 
78°. At higher latitudes, all fleld lines are open and swept into the night side. 
This region is called the polar cap. 

Figure 8.10 shows a sketch of the magnetosphere, drawn to scale to visu- 
alize the extent of the magnetotail. The solid lines give the fleld lines, and the 
dashed lines are the trajectories of plasma particles, which will be discussed 
in Sect. 8.3. Two important features are obvious: first, plasma is convected 
from the plasma mantle at the polar cusps towards the plasma sheet in the 
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Fig. 8.10. Sketch of the magnetosphere and the magnetotail, drawn to scale. The 
solid lines are magnetic field lines, and the dashed lines give the trajectories of 
plasma parcels, filling the plasma sheet from the mantle. X-points inside the plasma 
sheet are favorable positions for reconnection. Reprinted from W.G. Pilip and G. 
Morfill [411], J. Geophys. Res. 83, Copyright 1978, American Geophysical Union 



magnetotail. Thus a continuous flow of hot solar wind plasma Alls part of the 
magnetosphere. Second, in the equatorial plane there is a plasma sheet sep- 
arating the oppositely directed magnetic flelds of the north and south lobes 
(flelds directed toward and away from the Earth). Inside this plasma sheet, 
neutral point conflgurations suitable for reconnection can form. One example 
is indicated as the X-point. Thus the magnetotail obviously has dynamic as- 
pects: it is Ailed with plasma from the outside and occasionally reconnection 
in the neutral sheet leads to an ejection of plasma towards the Earth, where 
it can create beautiful displays of aurora. 

This magnetotail conflguration requires currents: the Chapman-Ferraro 
current in the magnetopause, and the tail current inside the plasma sheet 
separating the north and south lobes. Both currents are perpendicular to the 
magnetic held lines and form the closed current system sketched in Fig. 8.11. 

The radius of the magnetotail can be estimated by a simple approxima- 
tion. All held lines of the tail connect back to the polar caps of the corre- 
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Fig. 8.11. Cross-section of the magnetotail 
with the currents inside the magnetopause 
and in the neutral sheet forming a closed 
loop 
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spending hemisphere. The magnetic flux leaving the polar caps is defined by 
the vertical component of the magnetic field integrated over the polar caps: 

^pc — 27r(rE cos0Pc)^Bo • (8.16) 



Here 6pc is the latitude of the boundary of the polar cap and Bq is the 
equatorial magnetic field strength, which is about half the flux density at the 
polar cap. This flux is convected outwards into the magnetotail. If we assume 
the lobe to be semicircular with radius rxaii and magnetic field strength HTaii, 
the flux inside one lobe is ^xaii = 'TrrTaii^Taii/2. With (8.16) we get 
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(8.17) 



With 6pc = 75° and Bq = 31 000 nT, the radius of the magnetotail is 20 rp 
for jBxaii = 20 nT or 29 rp for Hxaii = 10 nT. The latter value is typical for 
the outer magnetosphere. 

The current density inside the magnetopause can be determined from the 
momentum balance at the magnetopause: Vpsowi — J x B/c. Alternatively, 
the cross-tail current density can also be determined from Ampere’s law to 
he 6B = 25xaii = 47 tj7c, with 6B being the jump in magnetic field strength 
across the current sheet. With Hxaii = 20 nT we find j = 30 mA/m for the 
cross-tail current and half this value for the magnetopause current. 

How far does the tail extend? Do the tail field lines close far away from 
the Earth or do they connect, at least partly, to the interplanetary mag- 
netic field? Figure 8.12 sketches such an open magnetosphere: close to the 
Earth, three types of magnetic field lines can be observed: (i) an interplane- 
tary magnetic field line of solar origin passing by; (ii) closed dipole field lines 
of planetary origin; and (iii) a merged planetary and interplanetary mag- 
netic field line which connects the surface of the Earth magnetically to the 
interplanetary medium. The dashed lines give neutral sheets where the inter- 
planetary magnetic field vanishes. A careful discussion of the many aspects 
of the magnetotail is given in [378]. 

For orientation. Fig. 8.13 gives the volumes of the magnetosphere predom- 
inantly occupied by open (top) and closed (bottom) field lines. The volume 
with the open field lines contains the tail lobes, including the mantle, the 




Fig. 8.12. Sketch of possible 
connections of the magneto- 
spheric field to the interplan- 
etary medium for two differ- 
ent polarities. Based on K.A. 
Anderson and R.B. Lin [6], J. 
Geophys. Res. 74 , Copyright 
1969, American Geophysical 
Union 
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Fig. 8.13. Volumes of the magnetosphere 
occupied by open (top) and closed (bottom) 
field lines, adapted from N. Crooker [114], J. 
Geophys. Res. 82, Copyright 1977, American 
Geophysical Union 



cusps, and the open portions of the low-latitude boundary layer on the day- 
side magnetosphere. The volume containing the closed field lines includes the 
plasma sheet, the quasi-dipolar inner magnetosphere, and the closed part of 
the low-latitude boundary layer. The shaded area indicates the part of the 
magnetosphere that abuts closed field lines. The figure is drawn as symmetric 
with respect to the equatorial plane and to noon. In reality, these symmetries 
are broken by the tilt of the geomagnetic dipole with respect to the plane of 
the ecliptic and the direction of the interplanetary magnetic field. 



8.2.5 Magnetosheath and Bow Shock 

A prominent feature in front of the magnetopause is the bow shock where 
the supersonic solar wind is slowed down to subsonic speed. The bow shock 
is about 2 to 3 rE ahead of the magnetopause, and its upstream medium is 
characterized by turbulence and energetic particles; see Sect. 7.6.4. 

The solar wind flow passes through the bow shock but does not penetrate 
the magnetopause. Thus the position of the bow shock must be adjusted so 
as to allow the solar wind to flow around the obstacle magnetopause. At the 
subsolar point, observational evidence suggests that the ratio between the 
position of the bow shock and the stand-off distance of the magnetopause is 
l.ln, with n being the density jump at the bow shock. If we assume a gas- 
dynamic shock, the density jump depends on the Mach number M and on 7ad 
as n = [(7ad — 1)M^ -f 2]/ (7ad + 1)M. With M = S and 7ad = 5/3, the bow 
shock is 29% farther out than the stand-off distance of the magnetopause. 

In particular, close to the subsolar point the observations are in quite good 
agreement with these theoretical predictions, as can be seen in Fig. 8.14. Here 
the positions of the bow shock and magnetopause in the equatorial plane 
are shown. The solid lines give the calculated magnetopause and bow shock 
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Fig. 8.14. Observed and calculated positions of the bow shock and the magne- 
topause in the equatorial plane; the scatter in the symbols indicates the variability 
of the magnetopause and bow shock due to solar wind variations. Reprinted from 
N.F. Ness et al. [376], J. Geophys. Res. 69, Copyright 1964, American Geophysical 
Union 



positions for average solar wind conditions, and the symbols give observed 
distances. Their scatter reflects the variability of the solar wind. 

In the magnetosheath, the region between the bow shock and the mag- 
netopause, the solar wind plasma is deflected and slowed down. Kinetic flow 
energy is converted into thermal energy, heating the plasma to about 5 to 10 
times the solar wind temperature. Spreiter et al. [499] used a gas-dynamic 
model to describe the plasma flow inside the magnetosheath. The magnetic 
held is considered only in so far as it is convected by the solar wind; however, 
it does not modify the dynamics of the process. Two sample solutions for the 
geometry symmetric around the Sun-Earth line are shown in Fig. 8.15. In 
the left panel, the stream lines give the deflection of the plasma flow around 
the magnetopause and, as the magnetic field is convected with the plasma. 




Fig. 8.15. Plasma flow inside the magnetosheath. (Left) Streamlines, which are 
also magnetic field lines, are shown. The dashed curve marks the transition from 
supersonic to subsonic flow. {Right) Velocity and temperature ratios. Based on [499] 
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also the magnetic field configuration in the magnetosheath. The dotted line 
marks the transition from supersonic to subsonic fiow. The right panel shows 
the velocity and temperature ratios. Their contours are identical, as can be 
seen from integration of the energy equation [499] which gives 



T ^ (7ad - ml 
Too 2 



1 









(8.18) 



The increase in plasma temperature can be quite substantial: on the dayside, 
the solar wind temperature can increase by up to a factor of 20. Since it is 
the sum of both electron temperature and ion temperature, with the electron 
temperature often about twice as high as the ion temperature, and since 
the electron temperature does not change significantly, the increase in ion 
temperature can be much larger than indicated in Fig. 8.15. 



8.3 Plasmas and Currents in the Magnetosphere 

We will now follow the path of our hypothetical astronaut from Sect. 8.2.1 
in more detail. However, to get the broader scope, we shall start with some 
basics of atmospheres. 

8.3.1 The Atmosphere 

The solar electromagnetic radiation determines the structure and dynamics 
of the atmosphere. Depending on the radiation’s wavelength, it interacts with 
the atmosphere at different altitudes. Since interaction always is associated 
with heating, a characteristic temperature profile develops which can be used 
to define atmospheric layers (see Fig. 8.16). 

The bottom layer of the atmosphere, the troposphere, has a thickness be- 
tween about 16 km at the equator and less than 10 km close to the poles. This 
layer contains more than three-quarters of the atmospheric mass and there- 
fore, energywise, is the most important layer. Its composition is basically 78% 
N 2 , 21% O 2 , and a number of trace gases, as well as up to 4% water vapor. 
The presence of water vapor allows the formation of clouds and precipitation, 
thus the troposphere is the weather layer of our planet. The combined effects 
of radiation, convection, and the transport of latent heat cause a negative 
temperature gradient of about 6.5 K/km. Out of the incident solar radiation, 
only the visible and infrared penetrate down to the troposphere; the shorter 
wavelengths are absorbed at higher altitudes. The top of the troposphere is 
the tropopause, the local minimum in the temperature profile. 

The next layer, the stratosphere, is characterized by a positive tempera- 
ture gradient caused by the absorption of UV in the ozone layer. Its shape 
stems from the same combination of effects that is responsible for the forma- 
tion of the ionospheric Chapman layers (Sect. 8.3.2): the incoming electro- 
magnetic radiation increases with height, while the density decreases. Thus 
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Fig. 8.16. Horizontal structure of the terrestrial atmosphere with different lay- 
ers defined by extrema in the temperature profile, mixing of the components, the 
possibility of particle escape, and the ionization 



at a certain height, a maximum in the ionization is established: below this 
height, the intensity of the electromagnetic radiation is too small for efficient 
ionization, and above this height there are not enough particles left to ion- 
ize. The maximum of the ozone layer is at a height of about 25 km. The 
stratosphere is dry; its water vapor content is almost negligible, although 
water vapor plays an important role in the ozone chemistry (Sect. 10.4.2). 
Since the tropopause is a temperature inversion, ideally there would be no 
exchange of matter across it. Exchange, however, happens, as is evident, for 
instance, from the influence of the anthropogenic CFCs on the ozone layer 
and from the deposition of cosmogenic nuclides formed in the stratosphere. 
Two effects allow such an exchange: violent processes, such as nuclear explo- 
sions or erupting volcanoes, and a seasonal slow exchange at the jet streams, 
where the tropopause is “leaky” . The consequences of this slow exchange are 
twofold: luckily, it is rather difficult for man-made gases to reach the strato- 
sphere, but unfortunately, once such substances have entered it, they will 
be removed only slowly. The stratosphere extends up to altitudes of about 
40-50 km. 

In the mesosphere, the temperature gradient is negative again. The meso- 
sphere extends up to a height of about 80 km; it is characterized by pho- 
todissociation, ionization (the D-layer of the ionosphere is inside the meso- 
sphere), and a variety of chemical processes. One prominent feature of the 
mesosphere is noctilucent clouds [178,287]: for examples see www.meteo. 
helsinki . f i/"'tpnousia/nlcgal/nlcgal . html, www . nlcnet . co . uk/, www . 
polarimage.fi, or www.iap-kborn.de/optik/nlc/nlc_kb_d.htm. 

Above the mesosphere, the temperature increases again because the hard 
electromagnetic radiation is absorbed, leading to the formation of the main 
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ionospheric layers. Thus the lower thermosphere is characterized by a posi- 
tive temperature gradient. Above an altitude of 150-200 km the thermosphere 
is isothermal, with temperatures between about 1300 K (nightside at solar 
minimum) and 2000 K (dayside at solar maximum). It is characterized by ex- 
tended circulation systems which vary seasonally and with the solar cycle. It 
also is the atmospheric layer that is connected directly to the magnetospheric 
processes: the currents providing the ionosphere-magnetosphere coupling run 
through the thermosphere, and the particles causing aurorae penetrate down 
to its bottom. 



8.3.2 The Ionosphere 

The ionosphere starts in a height of about 70 km as a charged-particle com- 
ponent inside the atmosphere. The ionosphere often is described as the base 
of the magnetosphere. Because of its high density and the existence of a large 
neutral component it does not obey the definition of a magnetosphere, namely 
that particle motion is determined by the magnetic field only. Nonetheless, 
it is vital for the understanding of the magnetosphere because it provides a 
highly conducting bottom layer and ionosphere-magnetosphere coupling is 
important for the energetics of the magnetosphere. 

Chapman Layers. The ionosphere is formed due to the ionization of atmo- 
spheric constituents by hard electromagnetic radiation in the UV and EUV 
range. Conveniently it is described as consisting of different layers. To de- 
rive the height profile of such a layer, the Chapman profile, we can use a 
simplified model considering one atomic species and monochromatic electro- 
magnetic radiation only. The variation of density n with height z is described 
by the barometric height formula 

n{z) = noexp{—z/H} , (8.19) 

where H = k^T j mg is the scale height. Thus the intensity decreases exponen- 
tially with increasing height. The intensity 1 of the ionizing electromagnetic 
radiation, on the other hand, increases with increasing height: it is maximal 
at the top of the atmosphere and then is absorbed according to Bougert- 
Lambert-Beer’s law 

dl/Az = —looCTaJl , ( 8 . 20 ) 

where da is the absorption cross-section for the particle species and frequency 
range under study. The intensity at height z then is given by 



I{z) = loo exp 




( 8 . 21 ) 




8.3 Plasmas and Currents in the Magnetosphere 303 




Fig. 8.17. The Chapman profile of an iono- 
spheric layer results from the superimposition 
of the height dependences of the particle den- 
sity and the flux of the ionizing electromag- 
netic radiation 



where 6 is the Sun’s altitude and 
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( 8 . 22 ) 



the optical depth. 

The electromagnetic radiation leads to a height-dependent ionization rate 

q{z) = na[I{z) (8.23) 

where a[ is the ionization cross-section. It is cTa > di because absorption not 
necessarily leads to ionization. 

The combination of the two profiles gives the Chapman profile (see 
Fig. 8.17): at a certain height, the ionization, and therefore also the charge 
density, is highest. Below, it decreases as the intensity of the ionizing radi- 
ation decreases. At higher altitudes, although the intensity of the ionizing 
radiation is higher, the charge density decreases, too, because the density of 
particles available for ionization is lower. Formally, we can insert (8.21) and 
(8.19) into (8.23) and get the charge density in the Chapman layer; 

q{z) = cTino/oo exp { ^ ~ • (^-24) 

t cos \J Jtl y 

If the Sun is in the zenith, the charge density is largest and the maximum of 
the Chapman layer is at lower altitudes. With decreasing solar altitude, the 
Chapman layer shrinks and its maximum shifts to higher altitudes. 

Equation (8.24) gives the ionization rate. It is thus a good approximation 
for the number of electrons created at a certain height. The dynamics of an 
ionospheric layer, however, are not only determined by the ionization but 
also by losses due to recombination and attachment to neutrals. These loss 
processes modify the daily variation of the electron density. 

Since the atmosphere consists of different particle species and the incom- 
ing radiation covers a broad spectrum, for each particle species such a layer 
forms in the ionosphere at its typical height. Figure 8.18 summarizes these 
layers: on the right, the densities of the neutrals are shown, and on the left. 
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Fig. 8 . 18 . Height dependence of different constituents of the ionosphere and at- 
mosphere for quiet solar conditions. Based on [255] 



the ionospheric layers are depicted for the different ion species as well as 
for the electrons. The electron distribution is the sum of the different ion 
layers. Note that with increasing height the relative importance of the ion- 
ized component increases: while at a height of about 100 km only 10~^ of 
the atoms and molecules are ionized, above a height of about 800 km only 
ionized particles exist. 

Since these layers have a large extension in height and partly overlap, 
they are not easily identified in observations. Instead, from the observations 
a more simple scheme for layering in the ionosphere has emerged early in 
ionospheric studies. The earliest detected layer was the E-layer, named so 
due to the reflection of electric fields. It is the best studied layer and is 
dominated by and NO“^. Here we find about one electron for every 10^ 
neutral particles. Below the E-region, between 60 and 90 km, is the D-region. 
It is highly variable with a much smaller electron content. Above the E-region 
is the F-region, which also contains the maximum in electron concentration. 
This maximum is typically given at altitudes around 300 km, however, it can 
shift with solar activity between 200 km and 800 km. 

Sudden Ionospheric Disturbances SID. The profiles of the different 
Chapman layers in Fig. 8.18 are shown for quiet solar conditions. Thus the 
hard electromagnetic radiation is at a rather low level and no additional ra- 
diation is emitted in flares. During solar maximum, the intensity of the hard 
electromagnetic radiation is higher, leading to a stronger ionization. In addi- 
tion, during solar flares the hard electromagnetic radiation can be enhanced 
even further. Then the charge density in the ionosphere can become large 
enough to absorb electromagnetic waves instead of reflecting them, leading 
to a break-down in long-wave communication. Such an event is called a sud- 
den ionospheric disturbance (SID). 




8.3 Plasmas and Currents in the Magnetosphere 305 



Ionospheric Conductivity. Currents and plasma flows couple the iono- 
sphere and the magnetosphere. The ionosphere differs from the magneto- 
sphere in so far as collisions of charged particles with the neutrals of the 
atmosphere occur frequently; the ionosphere therefore is characterized by a 
collision-dominated plasma. Thus the conductivity is flnite and the frozen-in 
approximation is no longer valid. The conductivity is not only finite but also 
highly anisotropic. Three typical conductivities can be defined. The field- 
aligned conductivity parallel to B depends on the masses ttIq and mi of the 
electrons and ions and on their collision frequencies Uq and 
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(8.25) 



This expression corresponds to the ordinary conductivity. 

The Pederson conductivity is concerned with currents parallel to the elec- 
tric field. With oji as the gyro- frequencies we have 
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Pederson currents dissipate energy since E • j >0. The Hall conductivity is 
concerned with currents perpendicular to both the electric and the magnetic 
fields. It is free of dissipation and can be written as 



C^Hall = 
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The total current in the ionosphere therefore can be written as 

^ ^ B X E ^ 

J = Cr\\E\\ -h (JpedE -f CTHall ^ = CF E , 



where 
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(8.27) 

(8.28) 
(8.29) 



is the conductivity tensor. 

Of these conductivities, the field-aligned one generally is the largest. To 
maintain current continuity, the electric field component E\\ parallel to the 
magnetic field has to be very small. In particular, at high latitudes the mag- 
netic field is almost perpendicular to an ionospheric layer and therefore allows 
for an efficient current (Birkeland current) between the lower ionosphere and 
higher altitudes, the basis for magnetosphere-ionosphere coupling. 

The Ionospheric Current System. Conductivities are highest in the E- 
region, where also strong winds and tidal oscillations can be observed. Owing 
to their different masses, ions and electrons are influenced differently by these 
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Fig. 8.19. Average Sq cur- 
rents in the ionosphere 



motions of the neutral atmosphere: while the ions are forced to move across 
the field lines, electrons tend to move perpendicular to both the magnetic 
field and the neutral wind, albeit at a slower pace. This relative motion causes 
a charge separation and thus an electric field which, in turn, can affect the 
currents. Owing to the creation of an electric field by the atmospheric motion, 
the E-layer also is called the dynamo layer. 

The relation between the conductivity and the electric field is described 
by Ohm’s law. However, here we have to add a term which considers the 
motion imparted by the neutral wind 

j = (7 (JE + t;n X B) (8.30) 

with Vn being the velocity of the neutral wind. At low latitudes, the dynamo 
current is mainly driven by the x B field arising from the ion motion 
across the ^-field while at higher latitudes the contribution from the neutral 
wind is small and the main driving force is the electric field. 

In mid-latitudes, the driving force mainly is provided by atmospheric tides 
excited by solar heating of the atmosphere. The resulting current system is 
called the solar quiet or sq-current system. These currents are responsible for 
the daily sq- variations in magnetometer records. Figure 8.19 gives a sketch 
of the basic features of the current system viewed from above the ionosphere. 
Basic features are the two vortices, one in each hemisphere. These systems 
touch at the equator where they form a strong, jet-like current, the equatorial 
electrojet. This current is larger than just the sum of the two currents in the 
vortices because the special geometry of the magnetic field (almost horizontal) 
and the nearly perpendicular incidence of the solar electromagnetic radiation 
increase the conductivity. 

8.3.3 Magnetosphere-Ionosphere Coupling 

The motion of particles, plasmas, and magnetic fields gives rise to currents. 
Currents in the magnetosphere associated with its large-scale structure are 
the Chapman-Ferraro current inside the magnetopause and the tail current 
separating the southern and northern lobes. These currents do not affect 
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Fig. 8.20. Magnetosphere-ionosphere cou- 
pling as a closed circuit 

the terrestrial magnetic field, and so their fiuct nations are not recorded 
by ground-based magnetometers. In addition, the drift of charged particles 
trapped in the radiation belts gives rise to a ring current. These currents are 
perpendicular to the magnetic field. Another kind of current fiows parallel 
to the magnetic field and therefore can provide coupling between the iono- 
sphere and the magnetosphere. Since the magnetic field is perpendicular to 
the ionospheric layer only at high latitudes, these field-parallel currents are 
a phenomenon typical of the polar ionosphere and magnetosphere. 

The entire configuration then can be interpreted as a circuit with the solar 
wind-magnetosphere interaction working as a dynamo {E • j > 0) and the 
ionosphere being a load with dissipative losses {E - j < 0). The circuit then 
is closed by currents parallel to B; see Fig. 8.20. The driving force, and thus 
also the source of energy, is the solar wind. Thus the dynamo has correctly 
been called the solar wind dynamo. 

Birkeland Currents. The field-parallel currents can be observed in the 
polar ionosphere where the magnetic field lines are almost perpendicular. 
The average patterns of these Birkeland currents are shown in Fig. 8.21. 
The currents are plotted versus geomagnetic longitude with the dark areas 
indicating currents into the ionosphere and the lighter shaded areas indicating 
currents out of it. At high latitudes, currents are fiowing out of the ionosphere 
in the evening side and into it at the morning side. They are called region 
1 currents. At somewhat lower latitudes, the region 2 currents show the 
opposite pattern: they fiow into the ionosphere in the evening side and out of 
it in the morning side. At very high latitudes around noon, i.e. below the polar 
cusps, the pattern of the field-parallel current is highly variable and strongly 
depends on the northwards or southwards component of the interplanetary 
magnetic field owing to the convection of magnetic field lines across the polar 
caps (see Sect. 8.4). Around midnight, the currents in regions 1 and 2 overlap 
without a clear separation. 

Most of the field-parallel currents are carried by the electrons. Thus an 
inward current implies an outward motion of electrons and vice versa. The 
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Fig. 8.21. Distribu- 
tion of the field-parallel 
Birkeland currents in 
the polar ionosphere. 
Note that the numbers 
refer to the geomagnetic 
latitude. Reprinted from 
T. lijima and T.A. Po- 
temra [249], J. Geophys. 
Res. 81, Copyright 1976, 
American Geophysical 
Union 



pattern of Birkeland currents also reflects the spatial distribution of aurorae 
at geomagnetic quiet periods: aurorae are observed where electrons stream 
down to the ionosphere, i.e. where the Birkeland current is directed upwards. 

The upward and downward Birkeland currents are closed by ionospheric 
currents. The auroral zone electric field in the ionosphere is directed 
northwards in the dusk sector and southwards in the dawn sector, i.e. from 
dusk to dawn. Since the conductivity of the ionosphere is finite, the Birkeland 
currents will be closed in the ionosphere by field-parallel currents (Pederson 
currents) northwards in the dusk sector and southwards in the dawn sector. 
A possible closure of the circuit is sketched in Fig. 8.22. Here a dusk-to-dawn 
cross-section of the magnetosphere is shown viewed from the tail towards 




Fig. 8.22. Possible 
scenario for the clo- 
sure of the ionosphere- 
magnetosphere-current 
system in a cross-section 
in the dusk-to-dawn plane 
viewed from the tail 
towards the Sun. Based 
on [59] 
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Fig. 8.23. Schematic repre- 
sentation of the various cur- 
rent systems in the mag- 
netosphere and ionosphere. 
Reprinted from R.L. McPher- 
ron [352], in Introduction to 
space Physics (eds. M.G. Ki- 
velson and C.T. Russell), 
Copyright 1995, with kind 
permission from Cambridge 
University Press 



the Sun. The closure of the ionosphere-magnetosphere circuit in the equato- 
rial plane is suggested to be radial in the dusk-to-dawn direction. Note that 
the dusk-to-dawn auroral zone electric field is reversed if mapped outward, 
in agreement with the dawn-to-dusk electric field driven by the solar wind 
and also with the cross-tail electric field. Thus in the magnetosphere, the 
electric field and currents are antiparallel, forming the generator proposed in 
Fig. 8.20, while they are parallel in the ionospheric load. 

Ring Current. Part of the ring current is also involved in the magnetosphere 
ionosphere coupling. It flows near dusk in the equatorial magnetosphere and 
is closed through the ionosphere by field-parallel currents. Part of the tail 
current is diverted into the ionosphere by field-aligned currents, forming the 
substorm current. The substorm current, as its name suggests, plays an im- 
portant role in geomagnetic and auroral activity. These currents, together 
with the current systems discussed so far, are summarized in Fig. 8.23. 

A summary of the entire magnetospheric current system and hints on 
many open questions is given in the articles in [384] . 

8.3.4 The Plasmasphere 

The plasmasphere is dominated by a dense and cold plasma of ionospheric 
origin, as is evident from the high ratio and the existence of other 

ion species such as He“^, and which cannot be found in the 

completely ionized solar wind. Spatially, it coexists with the radiation belts, 
extending up to heights of about 3 to 5 Earth radii. The particles have ener- 
gies close to 1 eV, and the density varies between 10^ cm“^ at about 1000 km 
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Fig. 8.24. Cross-section through the 
plasmasphere in the equatorial plane. 
The location of the plasmapause, as 
determined from Whistler observa- 
tions, is indicated for different times 
corresponding to different levels of 
geomagnetic activity. Based on [87] 



and 10-100 cm“^ at the outer boundary of the plasmasphere. The plasma- 
sphere is filled from the ionosphere by the polar wind. A comprehensive review 
of the properties of the plasmasphere can be found in [319]. 

The plasmapause as the relatively sharp outer boundary of the plasmas- 
phere was first proposed from the properties of Whistler waves in the magne- 
tosphere. Figure 8.24 shows the location of the plasmapause in the equatorial 
plane for three different times corresponding to different levels of geomagnetic 
activity. The bulge at the duskside is a persistent feature, although it can ro- 
tate somewhat in local time, depending on magnetospheric conditions. The 
plasmasphere - and with it the plasmapause - basically is a field-aligned 
structure. It can be traced from the equatorial plane down to the ionosphere. 

Figure 8.25 shows the plasma density plotted versus the L-shell parameter 
for the nightside plasmasphere for different levels of geomagnetic activity. 




Fig. 8.25. Variation of the nightside 
plasmapause with geomagnetic activity. 
Based on [96] 
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With increasing geomagnetic activity, the plasmasphere shrinks and its outer 
boundary becomes more pronounced. 

The plasmasphere corotates with the Earth, leading to an electric induc- 
tion field Ecorot = —{u: X r) X B{r). This field is smaller than the electric 
field driving the current system in the magnetotail; the plasmapause as the 
outer boundary of the plasmasphere separates these two current systems. 

8.3.5 The Geosphere 

The plasmasphere is embedded in the geosphere, a highly variable region 
filled with a hot plasma of low density. The plasma inside the geosphere has 
two sources, the ionosphere and the solar wind, as is evident from the com- 
position. Figure 8.26 shows the size and location of the geosphere together 
with the three main current systems. In the upper panel, the familiar merid- 
ional cross-section is given. The middle panel corresponds to a view from high 
above the North Pole towards the equatorial plane, and in the lower panel 




Fig. 8.26. Plasma regimes in the mag- 
netosphere with the darkness of the 
shading indicating the plasma density. 
Views from the duskside to the dawn 
(upper panel) ^ from pole to equator 
(middle panel) , and from the subsolar 
point tailwards (lower panel). The ar- 
rows give currents. Based on [363] 
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two cross-sections with views from the subsolar point tailwards are shown. 
The shading indicates the plasma density, the arrows indicate the Chapman- 
Ferraro current in the magnetopause, the tail current across the tail in the 
equatorial plane, and the ring current around the Earth associated with the 
radiation belt particles. 

The plasma density in the geosphere is much lower than in the plas- 
masphere; it is also lower than in the outer magnetosphere. Evidence for a 
contribution of the ionospheric plasma to the geosphere again comes from the 
presence of 0"^ and other heavier ions. The dominant species, H+ and He^+, 
in number are roughly independent of solar and geomagnetic activity, while 
their energy as well as the relative amount of ionospheric ions dramatically 
increases with increasing geomagnetic activity. Thus the solar wind ions 
and He^“^ are fed into the geosphere at a roughly constant rate while geo- 
magnetic activity strongly enhances the energy imparted to these particles as 
well as the density of the ionospheric component. In addition, the ionospheric 
outflow into the geosphere tends to be larger by up to a factor of 4 if the 
interplanetary magnetic held has a northward component, that is for a closed 
magnetosphere (see Sect. 8.4). 

8.3.6 The Outer Magnetosphere 

The outer magnetosphere is dominated by solar wind plasma. The basic 
mechanism for feeding plasma into the magnetosphere is reconnection. The 
magnetopause therefore is not an unpenetrable boundary separating two com- 
pletely decoupled systems. Instead, plasma transfer takes place almost every- 
where along the magnetopause. Since the fleld-lines of the magnetopause con- 
verge at the polar cusps, the high- latitude ionosphere is directly influenced 
by the solar wind plasma penetrating through the magnetopause. 

Even the simplest models of the magnetosphere had predicted an direct 
access of solar wind at the polar cusps. But the plasma in some proper- 
ties, in particular in thermal energy, is different from the solar wind plasma: 
since it comes from the magnetosheath, the temperature is higher than in 
the solar wind because the latter had been slowed down at the bow shock. 
This downward plasma flow can be detected at latitudes of about 78° for 
about ±3 h around local noon, i.e. exactly below the cusps, as an increase in 
electron density and temperature. While the electrons form a narrow beam 
penetrating downwards through the cusps, the ions are spread back towards 
the tail. This is caused by an EJ x B drift in the dawn-to-dusk electric field. 
Since the electrons have a very short travel time, their displacement by this 
drift is negligible. The ions, on the other hand, are much slower and there- 
fore are affected by the drift. Their displacement increases with increasing 
travel time, i.e. with decreasing energy. However, to penetrate the magneto- 
sphere efficiently, the particles must already gyrate around the magnetic held 
lines bordering the cusp. Thus, the process is far more efficient if the held 
lines are not closed but open: they do not connect the cusp regions of the 
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two hemispheres but connect to the interplanetary magnetic field. Then the 
magnetosphere is called an open magnetosphere. 



8.4 The Open Magnetosphere: Reconnection Applied 

In the early days of magnetospheric research two different explanations for 
the entry of solar wind plasma into the magnetosphere were offered: the con- 
cept of an open magnetosphere [139] where plasma and fields are exchanged 
between the magnetosphere and the interplanetary medium by reconnection 
and convection as opposed to the viscous interaction model [15] where par- 
ticles diffuse across the magnetopause of a closed magnetosphere. Since the 
entrance of solar wind plasma into the magnetosphere strongly depends on 
the orientation of the interplanetary magnetic field, namely a southward com- 
ponent, evidence is in favor of the open magnetosphere. 

In an open magnetosphere plasma transfer across the magnetopause is 
due to reconnection and thus requires an X-point configuration or neutral 
line where fields of opposite polarity meet. This is most likely to occur at the 
dayside magnetopause if the interplanetary magnetic field has a southward 
component. Then the magnetopause is a rotational discontinuity while at 
times of a northward interplanetary magnetic field it is a tangential discon- 
tinuity completely separating the interplanetary and the planetary plasmas. 
The corresponding configurations are an open and a closed magnetosphere. 

Compared to other astrophysical objects, such as solar flares, the magne- 
tosphere is the only plasma laboratory where reconnection can be studied di- 
rectly: at the day side magnetosphere flux transfer events give direct evidence 
for reconnection, in the tail reconnection can be observed in the formation 
of substorms. The evidence for these reconnection processes is summarized 
in [463]. 

8.4.1 Convection of Plasma Into the Magnetosphere 

Figure 8.27 sketches the dynamics of an open magnetosphere: solar wind 
convects the interplanetary magnetic field lines towards the magnetopause. 
If the interplanetary magnetic field has a southward component (open mag- 
netosphere), interplanetary magnetic field line 1' eventually merges (or re- 
connects) with the planetary field line 1 in a diffusion region in the dayside 
magnetosphere. Thus two mixed planetary/interplanetary field lines result (2 
and 2') which are convected tailwards with the solar wind, eventually becom- 
ing field lines of the geomagnetic tail (5 and 5'). The F-region ionospheric 
plasma joins this anti-sunward fiow since it is still magnetically connected 
to the convected field lines (see also the small inset in Fig. 8.27). During 
this anti-sunward motion of field lines, plasma from the magnetosheath is 
convected into the tail, first filling the mantle and later also moving down to 
lower latitudes, as indicated by the dashed lines in Fig. 8.10. If this process 
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Fig. 8.27. The convection of plasma into the magnetosphere. The numbered field 
lines show a succession of configurations of an interplanetary magnetic field line 1' at 
the front of the magnetosphere. Field lines 6 and 6' reconnect in the tail and return 
to the dayside at lower latitudes. The small figure shows the resulting convection 
pattern of the plasma in the northern high-latitude ionosphere: an anti-sunward 
fiow in the polar cap and a return flow at lower latitudes. The shaded area is the 
auroral oval. Reprinted from W.J. Hughes [240], in Introduction to Space Physics^ 
Copyright 1995, with kind permission from Cambridge University Press 



were to continue indefinitely, the entire geomagnetic field would soon be con- 
nected with the interplanetary field and magnetic flux would pile up in the 
tail. Since this is not observed, the magnetic flux must be returned to the 
closed magnetic held. This is achieved on the nightside magnetosphere: as the 
mixed held lines are pushed further towards the equatorial plane, a X-point 
results in the tail’s plasmasheet where lines 6 and 6' meet. Here reconnection 
sets in, forming a closed geomagnetic held line and a purely interplanetary 
magnetic held line. Magnetic tension will relax the geomagnetic held lines (7, 
8), which in time will return to the dayside magnetosphere at lower altitudes 
leading to a sunward flow of magnetic flux in the ionosphere. Thus a return 
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flow results. Magnetic tension also allows the interplanetary field line 7' to 
shorten and therefore to be pulled outwards through the geomagnetic tail. 

This picture is grossly simplified since in reality the entire process will be 
essentially non-steady. Thus although the time-averaged reconnection rates 
at the dayside magnetopause and in the tail must be equal, at any given time 
they can be quite different. Nonetheless, there is observational evidence for 
such a process to occur. In the late 1950s, it was realized that the plasma 
flow in the polar and auroral ionospheres must map outward and be related 
to a magnet ospheric flow pattern. Magnetometer measurements indicated a 
plasma flow over the polar regions from noon to midnight (points 1-6 in the 
inset in Fig. 8.27) and a flow back towards the dayside at lower latitudes 
(points 7-9). This process also can be described as a solar wind dynamo and 
is the driving process in the magnetosphere-ionosphere coupling and thus 
the vertical exchange of matter and energy. The plasma carried with the 
convected field lines leads to an electric convection field of the order of 50- 
100 kV, the dawn-to-dusk field. Combined with the corotating field of the 
plasmasphere, an asymmetric field results that also contributes to the partial 
ring current. 

The pattern described in Fig. 8.27 is roughly stationary in local time, i.e. 
in a frame of reference with a fixed Sun-Earth line. An observer on Earth 
rotates underneath this flow pattern, seeing it as a diurnal magnetic field 
variation. Since the flow pattern resembles a thermally driven flow cell, it has 
been termed a convection pattern, although it is not thermally driven. The 
lower panel in Fig. 8.27 shows this flow pattern on the duskside. Since the 
magnetic field is frozen into the plasma, we can map back the plasma flow to 
a pattern of motion of a magnetospheric field line which is swept across the 
polar cap and then returns to the dayside magnetosphere at lower latitudes, 
which is exactly the motion of the field line shown in the upper panel of 
Fig. 8.27. 

Note that Fig. 8.27 is meant as a schematic only. The details of the recon- 
nection process strongly depend on the local direction of the interplanetary 
medium. Also, correspondingly, the results of this process, in particular the 
amount of field lines convected into the tail and the resulting ionospheric 
currents, are highly variable. Magnetospheric parameters influenced by the 
properties of the interplanetary magnetic field, in particular its southward 
component, include the cross-tail electric field, the ring current and the au- 
rora, all of them smaller or weaker in the case of a non-southward interplan- 
etary magnetic field. Some empirical relations between solar wind conditions 
and geomagnetic parameters, such as the Dgt index have been suggested. The 
most global relation is concerned with the total solar wind power input into 
the magnetosphere [403,539], 
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Fig. 8.28. The Earth with an idealized dipole field, in a northward (left) and a 
southward {right) magnetic field. Reprinted from A. Brekke, Physics of the upper 
polar atmosphere [59], Copyright 1997, with kind permission from Wiley 



where B[p is the strength of the interplanetary magnetic field, 0 its angle with 
respect to the z-axis, and r^p the magnetopause distance. Equation (8.31) 
gives the flux density of magnetic energy (t^sowi ^ip/2/^o) into the approximate 
surface area of the magnetopause. 

In (8.31) a northward-pointing interplanetary magnetic held corresponds 
to 0 = 0 and thus Fin = 0* in the case of a northward held, no energy is 
transferred into the magnetosphere and the magnetosphere is closed. At most 
times, however, 0 will be different from zero and thus some amount of energy 
is transferred to the magnetopause. 

Figure 8.28 illustrates this energy flow, which depends on the north- 
south component of the interplanetary magnetic fleld. The terrestrial held 
is drawn as an idealized dipole; the interplanetary fleld has a northward 
component in the left panel and a southward component in the right panel. 
For the northward fleld, the magnetosphere is closed and the energy flux 
(Poynting vector) E x B = — (itgowi x B) x B is parallel to the fleld lines: no 
energy enters the magnetosphere. For a southward interplanetary magnetic 
fleld (right panel), the Poynting vector points into the magnetosphere and 
energy enters everywhere. 

The magnetic energy is only a small portion of the total solar wind energy 
because most energy is contained in the flow. The maximum ratio (for 0 = 
180°) between the imparted magnetic energy and the bulk kinetic energy of 
the solar wind can be approximated by 
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where Ma is the Alfvenic Mach number. For typical solar wind conditions, 
Ma « 7. 
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The details of the reconnection process at the magnetopause are still 
being debated; in fact, this problem is at the cutting edge of space physics. 
For summaries see, for example, [210,241,399,463]. 

8.4.2 Flux Transfer Events 

Direct evidence for reconnection between the planetary and interplanetary 
magnetic fields is obtained from in situ observations at the dayside mag- 
netopause. These observations are not related to the convection pattern 
sketched above, but directly confirm the exchange of plasma and field between 
the magnetosphere and the interplanetary medium. As a satellite passes 
through the magnetopause, short events in the magnetic-field, plasma, and 
particle data provide evidence for such fiux exchange. These events therefore 
are termed fiux transfer events; for observational evidence see, for exam- 
ple, [210,277]. The earliest observations were based on magnetic-field data 
only. A fiux transfer event can be identified as two consecutive short ex- 
cursions of the magnetic field component normal to the magnetopause, first 
to positive values, later to negative values, and afterwards returning to the 
undisturbed value around zero, if the satellite is in the northern hemisphere. 
The sequence is opposite for an observer in the southern hemisphere; see 
Fig. 8.29. These signatures are interpreted as an isolated magnetic fiux tube 
connected through the magnetopause to the geomagnetic field and convected 
northwards across the satellite by the solar wind; see Fig. 8.30. The composi- 
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Fig. 8.29. Magnetic field data during a magnetopause crossing, with evidence for 
flux transfer events indicated by the dashed vertical lines. Reprinted from C.T. 
Russell and R.C. Elphic [450], Geophys. Res. Lett. 6, Copyright 1979, American 
Geophysical Union 
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Fig. 8.30. Sketch of a magnetic flux tube 
through the magnetopause. The motion of 
such a flux tube leads to the signature of a 
flux transfer event in the magnetic field data. 
Reprinted from C.T. Russell and R.C. El- 
phic [450], Geophys. Res. Lett. 6, Copyright 
1979, American Geophysical Union 



tion of the plasma and the energetic particles inside the flux tube were both 
indicative of a magnetospheric origin in the example illustrated here. Since 
flux transfer events require reconnection at the dayside magnetosphere, these 
events are observed at times when the interplanetary magnetic fleld has a 
southward component. 

8.4.3 Release of Accumulated Matter: Substorms 

The release of matter convected into the magnetotail’s plasmasheet causes 
magnetic substorms which are seen in magnetic fleld variations, in particular 
in the AE index, as well as the aurora. The primary location of energy storage 
is the tail; the physical mechanism for the energy release is reconnection in 
the plasma sheet. The basic process can be likened to a dripping fountain: a 
drop at the outflow forms by the interplay between gravity pulling the water 
down and surface tension keeping it up. If enough water has been collected 
at the outflow, gravity takes over and the drop falls. In a magnetospheric 
substorm, a plasmoid in the magnetotail takes over the role of the drop: solar 
wind drag pulls on the magnetosphere, feeding plasma and energy into it. 
As the plasmoid grows, a magnetic neutral point forms close to the Earth. 
Here reconnection sets in, expelling the plasmoid tailwards and accelerating a 
small amount of plasma towards the Earth, causing the aurora at the Earth’s 
high-latitude nightside and the accompanying geomagnetic disturbance. 

This scenario is sketched in more physical terms in the Hones substorm 
model as shown in Fig. 8.31. Panel 1 gives the quiet-time conflguration of 
the magnetosphere, with an X-point at a radial distance of about lOOrE- The 
fleld line at this point is the last closed fleld line: in the direction of the Earth, 
all fleld lines are closed; at larger distances, all fleld lines are open. As recon- 
nection sets in at this X-point, the energetics of the plasma sheet are changed 
so that a second X-point forms much closer to the Earth (panel 2). As more 
energy is fed from the solar wind into the magnetosphere, reconnection at 
this inner X-point continues and a plasmoid is formed between the inner and 
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Fig. 8.31. Sequence of events leading to a magnetospheric substorm (see text). 
Based on E.W. Hones [228], in Magnetic reconnection (ed. E.W. Hones), Copyright 
1984, American Geophysical Union 



outer X-points (panels 3-6), until finally the plasmoid becomes detached at 
the inner X-point (panel 7) and is accelerated, leaving the magnetotail (panel 
8). The bulk of the energy fed from the solar wind into the magnetosphere is 
contained inside the plasmoid and thus is fed back into the solar wind. Only 
a small amount is converted to kinetic energy of plasma moving towards the 
Earth. As this plasma interacts with the high-latitude ionosphere, it causes 
an aurora and enhances the auroral electro jet. The tail slowly fills with new 
solar wind plasma (panels 9 and 10), until the initial configuration (panel 1) 
is restored and the cycle starts anew. 

Observational evidence for this scenario, together with a shortened ver- 
sion of the Hones substorm model, is presented in Fig. 8.32. The left panel 
shows the model and the location of the spacecraft; the right panel shows a 
superposed epoch analysis of the tailward plasma velocity, the total magnetic 
field, the north-south excursion Bz of the magnetic field, the fiux of >30 keV 
electrons in geosynchronous orbit, and the auroral-electro jet index AL. The 
dashed line marks the first occurrence of the high-speed fiow. With the ar- 
rival of this fiow, the spacecraft is engulfed by a region of lower magnetic field 
strength and the north-south component of the field rotates as the plasmoid 
travels across the spacecraft. The two lower panels indicate the characteristic 
consequences of substorms: an increase in electron fiux and a sudden depres- 
sion of the auroral electro jet. The onset of the substorm is about 30 min 
before the passage of the plasmoid in the tail - the delay is due to the fact 
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Fig. 8.32. Model and observations during plasmoid formation in the magnetotail. 
From M. Scholer [463], Copyright 2003, Springer-Verlag, based on [24] 



that the spacecraft does not observe the plasmoid during formation but only 
after it has propagated a considerable distance through the tail. 

The plasma travelling towards the Earth during the discharge of the mag- 
netotail leads to the substorm current, which has already been shown in 
Fig. 8.23. It is associated with the tail current, which during a substorm, 
is partly diverted as a field-parallel current towards the ionosphere in the 
evening sector, continues through the ionosphere as an auroral electro jet, 
and flows back towards the tail as a fleld-parallel current. The excursion of 
the tail fleld happens when, during the onset of reconnection, the tail field 
collapses. This current system is called a substorm current wedge because a 
projection of the current system onto the equatorial plane takes the form of a 
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Fig. 8.33. Energy flux from the 
solar wind into the magnetosphere. 
Reprinted from S.-I. Akasofu [4], EOS 
70, Copyright 1989, American Geo- 
physical Union 



wedge. The opening angle of the wedge is typically 70°, the current is about 
2 X 10^ A, and the wedge extends from the Earth into the tail for about 5rE. 

Figure 8.33 offers a simple illustration of this dependence of geomagnetic 
activity on solar wind flow and the southward component of the interplan- 
etary magnetic field in terms of the superimposition of two energy fluxes. 
The solar wind feeds energy continuously into the magnetosphere. The en- 
ergy partly is stored as magnetic field energy in the tail (small bucket to the 
right) and partly is converted to geomagnetic activity (outflow to the left). 
During weak geomagnetic activity as well as during large storms the solar 
wind energy is fed directly into the magnetosphere and ionosphere. At times 
of moderate geomagnetic activity, however, the energy is stored in the mag- 
netosphere before it is released. Solar wind energy is fed more efficiently into 
the magnetosphere if the latter is open, i.e. if the interplanetary magnetic 
field has a southward component, indicated by the handle at the outflow. 
Thus, more energy is available for release in the form of geomagnetic activ- 
ity. This release, however, is not continuous, but energy is stored over a time 
period of typically an hour and then liberated abruptly in a substorm. 

During the sequence shown in Fig. 8.31, an observer on Earth sees charac- 
teristic changes in the aurora. As the plasmoid in the tail grows, the auroral 
oval slowly expands equatorwards with the aurora still being a quiet arc. As 
reconnection sets in at the inner X-point, the initial auroral arc brightens, 
exhibits more structures and fast changing features, and moves rapidly equa- 
torwards. During the recovery, a rather quiet auroral arc or curtain contracts 
to the size of the initial auroral oval. 

More detailed discussions about the magnetosphere, its plasma sources 
and losses and its variability are given in [234,242]. 
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8.4.4 Closed, but Only Almost Closed 

Prom the above discussion, it may appear that all the interesting things 
happen when the interplanetary magnetic field has a southward component 
and that the magnetosphere is rather boring at times of a northward field. 
Judged from the aurora as a visible indicator of geomagnetic activity, this 
is simultaneously true and false: true in that the aurora is rather quiet and 
limited in space when there is a northward field, and false in that a special 
kind of aurora, the theta aurora, can be observed. 

There is also evidence for reconnection between interplanetary magnetic 
field lines and the geomagnetic field, and thus the magnetosphere is not en- 
tirely closed. Obviously, this reconnection cannot happen at the nose of the 
magnetosphere, because there the terrestrial and interplanetary magnetic 
fields are parallel. However, as the interplanetary field lines are convected 
over the magnetosphere, they encounter configurations suitable for recon- 
nection at the cusps where the fields of the magnetosheath and the lobes 
are antiparallel; see Fig. 8.34. After reconnection, the poleward portion of 
the fiux tube is convected tailwards by the solar wind. This is similar to 
the dayside reconnection in a southward interplanetary magnetic field. In 
the dayside portion of the field line, on the other hand, plasmas from the 
magnetosphere and the magnetosheath mix and the field line sinks into the 
magnetosphere. The observational signature of these reconnection events is 
unidirectional electron streams and an outfiow of ionospheric O"^. These ob- 
servations also suggest that reconnection does not occur simultaneously in 
both hemispheres. 




Fig. 8.34. Reconnection in 
the dayside magnetosphere 
for a northward interplane- 
tary magnetic field. Reprinted 
from Fuselier et al. [177], J. 
Geophys. Res. 106, Copyright 
2001, American Geophysical 
Union 
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8.5 Geomagnetic Disturbances 

8.5.1 Daily Variations 

The magnetogram of a geomagnetically quiet day shows systematic varia- 
tions in all three field components. The most pronounced variation can be 
observed around local noon. These variations are regular excursions, which 
are repeated each day. Their direction and magnitude depends on the geo- 
magnetic latitude of the observer. These Sq, or solar quiet, variations are 
related to the ionospheric sq current system (Fig. 8.19) 

Occasionally, the Sq variations are enhanced by a solar fiare: since its hard 
electromagnetic radiation leads to a stronger ionization of the dayside iono- 
sphere, the ionospheric current system is enhanced. In this case the change 
in magnetic field is called the sfe variation (sfe: solar fiare effect). 



8.5.2 Geomagnetic Indices 

For a quantitative description of the geomagnetic field the K and A indices 
are used. The basis for these indices are the magnetograms. The K index is a 
quasi-logarithmic number between 0 and 9 determined at the end of specified 
3-h intervals as the maximum deviation of the observed magnetic field from 
the expected quiet field. It is determined for each of the three magnetic 
field components separately. The largest of the maxima is converted to a 
standardized K index taking into account the geomagnetic properties of the 
observation site. Thus K indices of different stations, in particular of stations 
at high and low latitudes, can be compared and can be combined to give a 
planetary K index. 

At individual stations, the eight daily K indices are linearized to give an 
a index and than averaged arithmetically to give the A index describing the 
daily averaged magnetic activity. The construction of the K and A indices 
and the global network of observatories is described in [353]. 

Geomagnetic disturbances generally are described by two indices. Mag- 
netic storms can be quantified by the Dgt index which gives the excursion of 
the equatorial H component compared with quiet times. Physically, the Dgt 
index is related to the ring current. At high latitudes, an AE index is used, 
related to the auroral electro jet. It is also determined from the excursion of 
the H component compared with quiet times. Both indices are determined 
globally by combining different observatories at comparable geomagnetic lat- 
itudes but different longitudes. The index with the longer time record is the 
AA index which also uses the difference between observed and expected hor- 
izontal components but now at mid-latitudes. Owing to its long record, it is 
often used for correlative studies; however, physically more significant and 
easier to understand are the AE and Dgt indices. 

On these periodic quiet time variations, irregular disturbances on differ- 
ent time scales are superimposed. Fluctuations with time scales below about 
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0.2 s are waves, fluctuations with time scales between 0.2 s and 600 s are pul- 
sations of the magnetosphere, and variations with time scales above 10 min 
are geomagnetic disturbances. These geomagnetic disturbances can be ob- 
served worldwide; however, their amplitudes generally are largest at high 
geomagnetic latitudes and smaller, or even vanishing, at low latitudes. 

8.5.3 Geomagnetic Pulsations 

Magnetospheric waves and pulsations are phenomena which affect the entire 
magnetosphere. For instance, magnetic held fluctuations observed from the 
ground are highly correlated to fluctuations in the electric held observed from 
a satellite in geostationary orbit [186]. Nonetheless, amplitudes might vary 
with position. Such fluctuations are called geomagnetic pulsations. Continu- 
ous pulsations are grouped from Pci to Pc5 according to their periods, with 
Pci starting at periods of 0.2 s and Pc5 ending at 600 s. These waves are 
ultra-low frequency waves. Geomagnetic pulsations can be quite regular dur- 
ing quiet geomagnetic periods and become quite irregular during geomagnetic 
storms. Then they are termed Pil and Pi2. 

Geomagnetic pulsations act as coupling devices between different parts 
of the magnetosphere and ionosphere because they transport energy and 
information. Physically, the quiet time pulsations (Pcl-Pc5) best can be 
interpreted in terms of a cavity vibration of the entire magnetospheric cavity. 
The irregular pulsations Pi, on the other hand, seem to be Alfven waves. 

Wave generation requires an energy input. The departure from the equi- 
librium of the plasma and the field that drives the waves at least at quiet 
times appears to be related to the large scale convective flux. Thus the energy 
input is at the dayside magnetosphere. The irregular pulsations are driven by 
sporadic events, for instance the compression of the frontside magnetosphere 
at the beginning of a sudden commencement also causes an oscillation of the 
magnetospheric cavity [289]. 



8.5.4 Geomagnetic Storms 

Geomagnetic disturbances are also called magnetic storms or substorms, de- 
pending on their temporal and spatial extent. Magnetospheric substorms are 
the most frequent type of geomagnetic activity. Its most obvious manifesta- 
tion is the sudden explosion of a quiet auroral arc to more brilliant colors and 
moving structures. Over a period of an hour, they develop through an orderly 
sequence that depends on time and location. Simultaneously, a magnetome- 
ter on the ground below the aurora will record intense disturbances caused 
by the electric currents accompanying the aurora. These auroral electrojets 
are roughly parallel to a geomagnetic parallel circle, flowing at a height of 
about 120 km in concentrated channels of high conductivity produced by the 
same particles that generate the auroral emission. The disturbances in the 
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Fig. 8.35. The variation of the Dst index for a large magnetic storm 



magnetic field are in the range 200 nT-2000 nT, they typically last between 
1 h and 3 h, and are most pronounced at high geomagnetic latitudes. 

If the coupling of matter and energy from the solar wind into the magne- 
tosphere is stronger and lasts longer, a magnetic storm develops. Its temporal 
development can best be seen in the Dst index. Figure 8.35 shows the varia- 
tion of the Dst index for a large geomagnetic storm. The storm often begins 
with a sudden increase in the magnetic field. This sudden commencement 
may last for many hours. This initial phase is followed by a rapid and some- 
times highly disturbed decrease in Dst, which defines the storm’s main phase. 
Subsequently, Dgt recovers, first rather quickly, later more slowly. A storm 
lasts between 1 and 5 days with the initial phase anything up to 1 day, the 
main phase normally about 1 day, and the recovery phase lasting for several 
days. The distribution of storm magnitudes obeys a power law: storms with 
Dst between 50 nT and 150 nT occur about once in a month. Disturbances 
with Dst between 150 nT and 300 nT occur several times a year, while only a 
few storms with Dst above 500 nT can be observed over an entire solar cycle. 

The phases of a geomagnetic disturbance can be understood as follows: the 
increase in the magnetic field strength at the beginning of the disturbance can 
be attributed to the compression of the magnetosphere as the magnetopause 
is pushed inward by the increased solar wind speed. The decrease in the field 
strength during the main phase of the storm is due to an increase in the 
ring current, which creates a magnetic field opposite to the terrestrial one. A 
typical current density in the undisturbed ring current is about 10“^ A m~^, 
mainly carried by particles with energies between 10 keV and 100 keV at a 
height between 3 te and 6 te- During a strong magnetic storm, particles are 
injected from the plasma sheet in the magnetotail into the radiation belts, 
enhancing its density by an order of magnitude on time scales as short as 
10 min. This enhanced ring current then reduces the magnetic field measured 
on the ground. 
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Magnetic storms can be caused by fast solar wind streams or also by 
transient disturbances, such as interplanetary shocks and magnetic clouds. 
The same pattern as for recurrent solar wind disturbances emerges: the ge- 
omagnetic activity increases with increasing change in solar wind flux and 
is stronger if the interplanetary magnetic held or the held at the leading 
edge of the magnetic cloud has a southward component [196,521], that is 
the magnetosphere has an open conflguration. As a rule of thumb, an intense 
geomagnetic storm requires a southward component of the interplanetary 
magnetic field of more than 10 nT for at least 3 h. 

8.5.5 Geomagnetic Activity on Longer Time Scales 

Geomagnetic disturbances are not distributed uniformly in time. Instead, 
characteristic dependences can be observed which directly point to the phys- 
ical mechanisms responsible for them. In the late 1930s Chapman and Bar- 
tels [94] showed that the number of geomagnetic disturbances is related di- 
rectly to the number of sunspots and thus to solar activity. Figure 8.36 shows 
this close relation for the last 100 years using yearly averages of the sunspot 
number (lower curve) and the A A index (upper curve). Note that geomag- 
netic activity does not vanish during solar minima: while it is strongest during 
solar maximum due to the large number of transient disturbances, the ge- 
omagnetic activity during solar minima mainly is caused by corotating fast 
solar wind streams. This recurrent geomagnetic activity therefore shows a 
periodicity of 27 days. 

In addition, there is an annual variation with enhanced geomagnetic ac- 
tivity during the equinoxes. Figure 8.37 shows the Dgt index plotted versus 
time for 16 solar rotations during the 1974 solar minimum. Strong deviations 
of the Dst index to lower values are indicative of geomagnetic activity. For 
a better identification, these periods are blackened. Two systematic varia- 
tions can be identified in this figure. Most obvious is the recurrence of the 
geomagnetic disturbances during each solar rotation. These disturbances are 
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Fig. 8.36. Variation of sunspots and geomagnetic activity with the solar cycle 
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Fig. 8.37. Variation of the Dst index versus time for 16 solar rotations during 
the 1974 solar minimum. Reprinted from N.U. Crooker and G.L. Siscoe [115], in 
Physics of the Sun, vol. Ill (eds. P.A. Sturrock, T.E. Holzer, D.M. Mihalas, and K. 
Ulrich), Copyright 1986, with kind permission from Kluwer Academic Publishers 



related to fast solar winds, which can be observed best during the solar min- 
imum. But this recurrence is not observed for all 16 rotations: in the spring, 
the strongest geomagnetic disturbances are observed in the middle of each 
rotation, while in the autumn they are at the beginning of the rotation. In ad- 
dition, during summer and winter the disturbances are weak, while in spring 
and autumn they are more pronounced. 

To understand this change in pattern between the two equinoxes let us 
first look at one of these disturbances, the one beginning in the middle of 
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Fig. 8.38. Variation of solar wind density, speed, magnetic flux density, the north- 
south component of the interplanetary magnetic field with negative values indi- 
cating a southward interplanetary magnetic field, and geomagnetic activity over 
solar rotation 1921. The dashed vertical lines link the times of a southward ex- 
cursions by the interplanetary magnetic field and enhanced geomagnetic activity. 
Reprinted from L. Burlaga and R.R Lepping [71], Planet. Space Sci 25, Copyright 
1977, American Geophysical Union 



rotation 1921 in Fig. 8.37. Figure 8.38 shows the density and speed of the 
solar wind, the flux density and north-south component of the interplanetary 
magnetic field, and the AE (auroral electrojet) index as a measure of geomag- 
netic activity related to aurorae and substorms. For solar rotation 1921 two 
fast streams can be identified, one starting in the data gap on 15 January, the 
other starting on 25 January. Immediately before the beginning of this stream 
the increase in plasma and magnetic fiux density indicates the compression 
region in front of the fast stream. The envelope on the AE index traces the 
solar wind speed, thus changes in the solar wind are related to geomagnetic 
disturbances. But even at times of rather constant solar wind speed there 
can be intense although short geomagnetic disturbances (for instance the one 
on the evening of 18 January). These disturbances are strongest when the 
interplanetary magnetic field has a southward component. 

With this information we can go back to the interpretation of Fig. 8.37. 
The fact that geomagnetic disturbances are recurrent is related to the ex- 
istence of two recurrent fast solar wind streams, one at the beginning and 
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the other in the middle of each solar rotation. The geomagnetic effectiveness, 
on the other hand, depends on whether the interplanetary magnetic field 
has a southward component or not, i.e. whether the magnetosphere is open 
or closed. As discussed earlier, an open magnetosphere also means an eas- 
ier exchange of energy and matter between the interplanetary medium and 
the terrestrial environment, which is essential to drive geomagnetic activity. 
But the annual variation of geomagnetic effectiveness does not indicate that 
the fast streams change polarity during the course of the year. Since they 
are related to the coronal structure and magnetic field, their polarity stays 
constant. The change occurs in the position of the Earth and the terrestrial 
magnetic field relative to the interplanetary magnetic field. The solar wind 
and thus the interplanetary magnetic field is fixed with respect to the Sun’s 
equatorial plane. The plane of ecliptic, which also defines the orbit of the 
Earth, is inclined by 7.2° with respect to this plane, with both planes inter- 
secting in the equinoxes while the Earth is above (below) the solar equator 
at the summer (winter) solstice. In addition, the axis of the Earth’s rota- 
tion is tipped at 23° to the ecliptic plane. The combined effects regulate the 
southward component of the interplanetary magnetic field with respect to the 
terrestrial field. If the magnetic field is directed away from the Sun, in spring 
it has a strong southward component in geomagnetic coordinates while in 
autumn it has a very weak southward or even a northward component. If 
the interplanetary magnetic field is directed inwards, the pattern is just the 
opposite. Thus a fast solar wind stream which has a high geomagnetic effec- 
tiveness in spring is less efficient in autumn, while a stream with an opposite 
direction of the interplanetary magnetic field shows the reverse pattern. At 
the solstice, in both streams the southward component is diminished, leading 
to a reduced geomagnetic effectiveness. 



8.6 Aurorae 

Aurorae, or polar lights, are a prime example of solar-terrestrial relation- 
ships. They also provide an interesting example of the development in geo- 
physical research. And they are simply beautiful and, depending on his habi- 
tat, have fascinated or frightened mankind since historical times. Recent re- 
views about the plasma physical aspects of aurora are given in [330,400]; 
the more popular aspects of aurora together with numerous examples are 
discussed in [54,60,142]. Internet resources on aurora are e.g. www.geo.mtu. 
edu/weather /aurora/, www.northern-lights.no/, www.polarimage.fi or 
www.pfrr.alaska.edu/aurora/INDEX.HTM, and on aurora prediction from 
satellite data sec.noaa.gov/pmap/. 

The aurora, also called the northern light, is a typical phenomenon of the 
high latitudes, where under suitable conditions (clear sky, no full moon), it 
can be observed almost constantly. Under normal conditions, the aurora is a 
colored arc extending roughly from east to west, changing its appearance in a 
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Fig. 8.39. Typical shapes of the au- 
rora: homogeneous arc HA, rayed arc 
RA, homogeneous bands HB, rayed 
band RB, rays R, coronae C, and 
draperies D. Reprinted from A. Brekke 
and A. Egeland [60], Northern lights^ 
Copyright 1983, Springer- Verlag 



typical pattern during the night. Under geomagnetically disturbed conditions, 
the aurora brightens, becomes highly structured, moves equatorwards across 
the sky, and changes its appearance fast. Typical auroral structures are shown 
in Fig. 8.39, with the arcs and bands more typical of geomagnetically quiet 
conditions and the draperies, rays, and corona more often observed during 
geomagnetic activity. The aurora is less bright than the full moon, and thus 
although even in mid-Europe some aurorae occur each year, often they are 
difficult to detect because of a city’s counterglow in the sky. 

8.6.1 Historical Excursion 

Records of the aurora can be traced back for at least 2500 years. The an- 
cient Chinese described dragons winding in the sky, calling the aurora “flying 
dragons” . More detailed records date back to the Romans. Although made at 
low latitudes (Mediterranean), these observations describe the many shapes 
and colors normally only observed in the auroral oval. But the most frequent 
aurora in low latitudes is a reddish glow at the horizon, often misinterpreted 
as a burning farmstead or village. For instance, Seneca writes that the em- 
peror Tiberius sent troops to the village of Ostia because an aurora evoked 
the impression of the village being in flames. Such misinterpretation can be 
traced throughout history. Even in the twentieth century reddish glows have 
caused false alarms of distant forest fires. The ancient Greeks, too, saw the 
aurora and named it “chasmata” , which means frightening apparition. 

At middle and low latitudes the reddish color, combined with the small 
number of sightings, has led to interpretations of the aurora in terms of bad 
omens of war, famine, fire, and pestilence, or clerics interpreted it as a battle 
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Fig. 8.40. Fantastic illustration of an aurora observed from Bamberg, Germany, 
in December 1560. The flashing lights in the northern sky are interpreted as sparks 
from clashing swords in a heavenly battle 



in the heavens between good and evil, with rays appearing as swords, spears, 
or faculae. A typical example of such an interpretation is shown in Fig. 8.40 
for an aurora observed in 1560 from the German town of Bamberg. 

Since aurorae were observed only occasionally, such a superstitious inter- 
pretation in an end-of-the world mood was typical of mid-Europe. Cultures, 
which from the very same scholars were regarded as primitive (note that at 
the time of the woodcut shown in Fig. 8.40 the discovery of America was al- 
ready history), often had a healthier attitude towards these lights. One of the 
reasons obviously is the higher frequency of occurrence for people living at 
higher geomagnetic latitudes, taking away much of the horror and sometimes 
even encouraging the search for natural explanations. In the King's Mirror^ a 
Norse chronicle of 1259 as many as three alternative explanations are offered, 
which in the framework of the then world picture are quite reasonable: 

The men who have thought about and discussed these lights have guessed at 
three sources, one of which, it seems, ought to be true. Some hold that fire circles 
about the ocean and all the bodies of water that stream about on the outer side of 
the globe; and since Greenland lies on the outermost edge of the Earth to the north, 
they think it is possible that these fires shine forth from the fires that encircle the 
outer ocean. Others have suggested that during the hours of night, when the Sun’s 
course is beneath the Earth, an occasional gleam of light may shoot up into the sky, 
for they insist that Greenland lies so far out on the Earth’s edge that the curved 
surface which shuts out the sunlight must be less prominent there. But there are 
still others who believe (and it seems to me not unlikely) that the frost and the 
glaciers have become so powerful there that they are able to radiate forth these 
flames. 
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With the image of the Earth as a flat disk and the daily experience of glitter- 
ing water and gleaming glaciers, all three explanations are quite reasonable. 

Nonetheless, although aurorae were part of the daily experience, other 
cultures close to the northern Arctic Circle have offered less scientific and 
more mythological interpretations. Many Inuit tribes interpreted aurorae as 
torches in the hand of Gods leading the souls of their deceased or as the souls 
of the departed playing ball on the heavenly meadows. Other Eskimo tribes 
interpreted them as the dances of their Gods. A similar interpretation was 
offered by the Scotts about 2000 years ago (they called the aurorae ‘merry 
dancers’) or by the Aboriginals. Their neighbors, the Maori, interpreted the 
aurorae as fires set ablaze by their ancestors who, in their canoes, had drifted 
too far to the south and now had lightened fires to keep themselves warm. 

But even cultures that rather often saw aurorae were not immune against 
superstition. The North American native Indians, for instance, also inter- 
preted an aurora as a bad omen or fighting tribes. And even the Laps, who 
should be acquainted with the aurora rather well, interpreted it as a sign of 
violent death, as the souls of the victims. 

8.6.2 Beginning of the Scientific Analysis 

At the beginning of the scientific analysis, the aurora had been interpreted 
as the counterglow or reflection of a natural light source, for instance the 
reflection of the Sun, already below the horizon, from clouds or a flat surface 
of water, ice, or snow. Alternative interpretations included natural terrestrial 
sources such as volcanos, streams of lava, or bog fires. These explanations are 
particularly attractive for an aurora observed as a reddish glow just above 
the horizon, such as observed most often from mid-Europe. 

This line of thought survived for rather a long time, although in the middle 
of the eighteenth century Cavendish, using triangulation, found the heights 
of the aurora to be at about 100 km. Thus the aurora cannot be explained 
by reflection from clouds at much lower altitudes. A very systematic analysis 
of the heights of the lower edges of the aurora was performed by Stprmer. 
Figure 8.41 shows his results: aurorae are observed in a wide band of altitudes 
ranging from just above 80 km to more than 500 km. Nonetheless, most 
aurorae had their lower edge at an altitude between 90 km and 150 km. 

In the 1860s, a spectroscopic analysis of the aurora by Angstrom brought 
an end to all reflection theories. Since pressure in the high atmosphere is 
very low, the auroral emission consists of many forbidden lines, at that time 
unknown from laboratory experiments. But although these lines were iden- 
tified only in the 1920s, the limitation of auroral emissions to a few lines 
gave evidence against the reflection of a continuous solar spectrum. Thus the 
aurora had been identified as an independent phenomenon, originating in the 
discharge of excited gases. 

Hints at its origin had developed only slowly; in particular, the relation- 
ship between solar and auroral activity had long been debated. In 1730, 
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Relative Number of Aurorae 



Fig. 8.41. Height distribution of aurorae, 
based on [505] 



Lewis proposed that sunspots were responsible for auroral activity because 
the number of aurorae and their southward extent, which also leads to higher 
detectability and therefore higher frequency at lower latitudes, increases with 
sunspot number. Since it was difficult to establish a causal chain between 
these phenomena, Lewis’s hypothesis was disregarded. More than a century 
later it was revived by two different observations. In May 1859, Carrington 
observed a flare in white light, which not only gave rise to the first record of a 
flare but two days later also to a violent geomagnetic storm and strong auroral 
activity even at mid-latitudes. Carrington himself speculated on a relation- 
ship between the flare and the aurora; however, he also cautioned against 
hasty conclusions since one swallow does not make a summer. A more sys- 
tematic analysis was published in 1873 by H. Fritz, see Fig. 8.42, confirming 
the close correlation between sunspot number and aurora proposed about 
150 years earlier by Lewis. Fritz described the results of his analysis, which 
considered data from the past 100 years, as follows: 

The aurora is a periodical phenomenon and closely related to the formation of 
dark spots on the Sun. The times of the richest exhibition of spots on the central 
body of our planetary system are characterized by rich and magnificent light phe- 
nomena around the poles of our Earth, while times of sunspot minima correspond 
to rareness, weak development, and small spatial extent of aurora. During these 
times, in the mid-latitudes of our planet the aurora vanishes while during maxi- 
mum times it extends downward from both poles, occasionally even close to the 
equator. 

Because of this close relationship between aurora and solar activity, the fre- 
quency of aurorae at mid-latitudes can be used as a proxy for solar activity, 
in particular when historical times are concerned where no or only scattered 
records of sunspots are available. 
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Fig. 8.42. Correlation between sunspot number and frequency of aurora. Reprinted 
from A. Brekke and A. Egeland [60], Northern lights^ Copyright 1983, Springer- 
Verlag 



Fritz not only studied the frequency of aurorae but also their spatial dis- 
tribution, as did Muncke, Franklin, and Loomis before him. The distribution 
of auroral activity can be described in a map of isochasms (see Fig. 8.43); an 
isochasm is a line of equal frequency of an unusual or frightening apparition. 
The isochasms were found to be ovals around the geomagnetic pole with the 
maximum at geomagnetic latitudes around 70°. For the auroral oval around 
the North Pole, the southernmost extension of about 60° latitude is at about 
90° western longitude. Then the isochasm deviates northward from the par- 
allel and goes through Baffin Bay, around the southern tip of Greenland, 
crosses Iceland and the northern part of Spitzbergen, reaching the highest 
northern latitude at about 40° eastern longitude. Then it deviates southward 
from the parallel, turning back to its starting point on a route across the 
Siberian Ice Sea and just north of the Bering Strait. Farther north or south 
of this line, the aurora is less frequent. Nonetheless, the southernmost north- 
ern light recorded so far was observed on 15 September 1909, from Singapore, 
just a few degrees north of the equator. 

Thus the spatial distribution of aurorae somehow is governed by the ter- 
restrial magnetic field. A connection between aurora and magnetic field vari- 
ations had been observed even earlier. For instance, in 1749, Celsius and 
Hiorter published reports on the relationship between magnetic field distur- 
bances and the aurora, wondering “who would have thought that there is a 
relation between the aurora and the compass needle, and that the northern 
light, if moving southwards across the zenith, can cause a deviation of the 
magnetic needle by some degrees?” . 

8.6.3 Modern Interpretation 

A simple, though incomplete, analogy of our understanding of the aurora is 
the cathode-ray tube in a television set: particles accelerated at the cathode 
(during a substorm in the plasma sheet in the magnetotail) are deflected by a 
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Fig. 8.43. Chart of isochasms, i.e. lines of constant aurora frequency. Reprinted 
from H. Fritz [173], Das Polarlicht^ Copyright 1881, with kind permission from 
Spektrum Akademischer Verlag 



field (the geomagnetic field) and emit light on hitting a fiuorescent screen (the 
upper atmosphere). Early last century, long before the development of the 
TV, Birkeland [50] built the terrella, a model that allowed simulation of the 
auroral distribution in the laboratory. It completely illustrates this analogy: a 
cathode-ray tube in a vacuum chamber emits electrons towards a sphere, the 
terrella, covered with a fiuorescent material. If an electromagnet inside the 
terrella is switched on, the terrella emits light only from two circles around 
its poles, while with the electromagnet turned off, light is emitted from all 
over the hemisphere viewing the cathode-ray tube. 

8.6.4 Electron Acceleration 

Although this analogy is vivid, it is incomplete. The basic difference between 
the aurora and a TV set lies in the motion of the particles just before the 
screen. As the particles are accelerated in the plasma sheet, they propagate 
towards the Earth, are deflected by the magnetic field from equatorial regions 
towards higher latitudes, and then have to penetrate into the atmosphere to 
excite the atoms and molecules which in turn emit the light seen as aurora. 
The fundamental problem in this chain of events is the propagation of the 
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particles deep into the atmosphere. Moving closer towards the Earth, the 
particles find themselves in a converging magnetic field. The constancy of the 
magnetic moment then leads to a refiection back towards the magnetosphere. 
This refiection occurs at heights of about 1000 km, where densities are too 
small for recognizable light emission. Particles could penetrate deeper into the 
atmosphere if they were accelerated during their motion. Although the details 
of the acceleration are still under debate, the spectra of electrons measured by 
rockets at heights of some hundreds of kilometers clearly give evidence for it: 
outside of auroral arcs, the spectrum is roughly a power law up to energies 
of at least 10 keV. Over an auroral arc, however, there is a pronounced 
peak superimposed on this power law, exceeding the power law intensities by 
up to two orders of magnitude at energies of a few kiloelectronvolts. Thus 
the additional electron component is nearly monoenergetic which strongly 
suggests acceleration in an electric field. 

Today it is assumed that at heights of some thousands of kilometers a 
potential structure develops along the field line with a higher positive poten- 
tial at lower altitudes. Although such a structure would explain the observed 
electron spectra, an explanation of the structure itself is difficult. Since the 
plasma is collisionless, according to (8.25) the conductivity g \\ parallel to the 
magnetic field is infinite and the magnetic field lines are equipotential lines. 
Thus parallel electric fields are canceled immediately. 

Different processes for the development of the potential structure are 
discussed [52,190,209]. All mechanisms agree that the fundamental driving 
mechanism is an increase in the magnetospheric convection due to changes 
in solar wind properties. One of the mechanisms under discussion is the de- 
velopment of double layers. A double layer forms when currents are flowing 
between plasmas with different properties. In geospace, these are the cold and 
dense ionospheric plasma and the hot and rarefied magnetospheric plasma. 
For a double layer to form and to be stable, a field-parallel current must 
already flow. In the high-latitude ionosphere this would be the Birkeland 
current, connecting the ionosphere with the magnetosphere. While it is diffi- 
cult to describe the formation of a double layer, we can at least explain how 
it can stay stable. If the double layer has formed, a potential drop exists. 
The particle populations encountering this drop can be divided into elec- 
trons and ions, ionospheric and magnetospheric plasma, and thermal and 
suprathermal particles. Let us start with the ionospheric population. This is 
a thermal population, consisting of ions and electrons. Ions moving upward 
from the ionosphere towards the double layer see a decreasing potential and 
thus are accelerated, leaving the ionosphere through the double layer. Iono- 
spheric electrons, on the other hand, are reflected back into the ionosphere. 
For the magnetospheric plasma, the situation is just the opposite: flowing 
towards the Earth it encounters a positive potential drop, leading to an ac- 
celeration of electrons towards the ionosphere while the ions are deflected. 
In addition, electrons are created inside the double layer by the interaction 
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between the accelerated magnetospheric ions and neutrals. These electrons 
are accelerated downwards, too. The net effect is an acceleration of the elec- 
trons without destroying the potential drop. Note that this works only if the 
flow speed of the magnetospheric electrons is larger than their thermal speed, 
that is a current already flows. This is the upward Birkeland current shown in 
Fig. 8.23. This is in agreement with the observation of aurorae being limited 
to regions where the Birkelands current flow upwards, i.e. where electrons 
move from the magnetosphere to the ionosphere. 

A modification of the double layer concept is the electrostatic shock. This 
is a double layer which is not stationary like the one discussed above but 
moves with the average ion speed along the magnetic field line. 

Observations suggest that a different mechanism also is important in the 
electron acceleration: the pitch angle distributions of electrons and ions above 
auroral arcs are different. Thus both populations are reflected at different 
positions in the converging magnetic field. Since the particles stay at the 
reflection point for rather a long time (their velocity parallel to the field 
vanishes), a charge separation and thus an electric field results which in turn 
accelerates the lighter species, the electrons. 

8.6.5 Excitation of the Atmosphere 

On hitting the denser atmosphere, the electrons cause electromagnetic emis- 
sions due to excitation (M + e~ M* + e~) or excitation and ionization 
(M + e — -f 2e~) of the neutrals. Here M denotes an atmospheric con- 
stituent, such as N, N 2 , O, and O 2 . Auroral lines are emitted in the entire 
range from UV to IR. The most intense lines in the visible are the green 
oxygen line at 557.7 nm and the red double line of oxygen at 630 nm and 
636.4 nm. Both are forbidden lines, thus early observers were not able to 
identify them. Another intense line is emitted by the nitrogen molecule at 
427 nm, a weaker one at 470 nm. Both lines result from transitions between 
different vibrational states and can be observed only at the lower edge of the 
aurora since nitrogen molecules are rare above 120 km. Table 8.1 summarizes 
the most important auroral lines and the heights where they are emitted. 

Not only electrons but also protons can excite the neutral atmosphere. 
Then the excitation is due to charge exchange: the proton is decelerated and 
becomes an excited hydrogen atom. This, in turn, emits either the La line 
in UV or the Ha line in the red. Thus proton aurorae are always red. In 
addition, they are less structured than electron aurorae, cover larger areas, 
and are observed only at quite high altitudes, i.e. between 300 km and 500 km. 
Proton and electron aurorae can occur simultaneously. A special example of 
a proton aurora is the polar glow that forms when solar protons penetrate 
into the day side magnetosphere along the cusps. 
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Table 8.1. Frequent lines in the auroral emission. The Ha line is observed in proton 
aurorae only 



Wavelength Emitting Altitude Visual 

(nm) species (km) color 



391.4 


N+ 


1000 


violet-purple 


427.8 


N+ 


1000 


violet-purple 


557.7 


0 


90-150 


green 


630.0 


0 


>150 


red 


636.4 


0 


>150 


red 


656.3 


Ha 


200-600 


red 


661.1 


N2 


65-90 


red 


669.6 


N2 


65-90 


red 


676.8 


N2 


65-90 


red 


686.1 


N2 


65-90 


red 



8.6.6 Shape and Local Time 

But the aurora is not only limited to the cusps and the nightside. Satellite 
observations indicate that often a closed auroral oval can be observed and 
observations from the ground show that auroral activity is not only limited 
to a few hours around local midnight but can be observed during the entire 
night (and during the polar night even at day-time). In this case a variation 
of the aurora with local time can be observed, as shown in Fig. 8.44. In 
the shaded area between noon and midnight, diffuse aurora form or quiet 
and stable arcs. After about 20 LT, these arcs become more wavy, forming 

12 



Fig. 8.44. Shape of the 
aurora in dependence on 
local time. Reprinted from 
K. Schlegel [460], in Plas- 
maphysik im Sonnensy- 
stem (eds. K.-H. Glass- 
meier and M. Scholer), 
Copyright 1992, with kind 
permission from Spektrum 
Akademischer Verlag 
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complex patterns of bands which particularly during a substorm move across 
the sky in a westward travelling surge. During the decay phase of a substorm, 
these bands resolve into isolated patches travelling eastward. These patches 
frequently are observed during the morning hours. The small arcs on the 
dayside at latitudes of about 75° are formed when solar particles or the solar 
wind penetrate into the polar cusps. 

A special case is the theta aurora. It occasionally can be observed from 
high-flying satellites as a closed auroral oval supplemented by an arc extend- 
ing across the polar cap from the dayside to the night side, giving the aurora 
the shape of the greek letter 0. It is observed only at times of a northward 
interplanetary magnetic held, i.e. at times of a closed magnetosphere. The ex- 
istence of this arc is difficult to understand since the field lines from the polar 
cap connect back to the lobes where the plasma density is very low. Current 
interpretations involve boundaries along the Sun-Earth line with Pederson 
currents converging over the caps at this boundary and then flowing upwards. 



8.7 Energetic Particles in the Magnetosphere 

Particle populations in the magnetosphere have different sources and prop- 
erties. A simple distinction is based on rigidity. Particles with high rigidity 
are able to traverse the magnetosphere, and thus no long-lived trapped par- 
ticle components with high rigidities exists. High-rigidity particles coming 
from the outside, such as galactic cosmic rays, depending on their direction 
of incidence, penetrate deep into the magnetosphere and interact with the 
upper atmosphere or are deflected back into space. For low-rigidity particles, 
three cases can be distinguished, see the right-hand side in Fig. 8.45. Parti- 
cles hitting the low-latitude magnetosphere from the outside perform half a 
gyro-orbit inside the magnetosphere and then are reflected back into space. 
Only at the polar cusps can these particles penetrate into the magnetosphere, 
and on interaction with the atmosphere produce polar cap absorption (PC A) 
events. The third low-rigidity particle component is different from all particle 
populations discussed above in so far as it is not a transient but a long-lived 




Fig. 8.45. Orbits of particles 
with low (right) and high (left) 
magnetic rigidity 
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component: particles are trapped inside the radiation belts. Their motion is 
regulated by the adiabatic invariants; nonetheless, radiation belts are not a 
static phenomenon but a dynamic one with sources and losses depending on 
the other particle populations and on geomagnetic activity. 

8.7.1 The Radiation Belts 

The discovery of the radiation belts was the first significant scientific result of 
space research with satellites. The first observations were made with a Geiger 
counter on board Explorer 1, launched on 31 January 1958. The discovery 
of the radiation belt was accidental. The Geiger counter had been designed 
by a group around J. van Allen from the University of Iowa to measure 
cosmic rays in the high atmosphere and had been adjusted to accommodate 
the expected fluxes. During short parts of the satellite orbit, the observations 
met the expectations; for long times, however, the observed fluxes either were 
much too large or way too small, raising doubts about the performance of the 
instrument. But from the pattern of expected and unexpected counting rates, 
it became evident that the unexpected signals always indicated particle fluxes 
much higher than expected, with the low counting rates being a saturation 
effect. Subsequent measurements with Explorer 3 and Sputnik 3, both in 
1958, confirmed the existence of the radiation belts. 

First Observations. The first measurements did not identify the particle 
species or energy. Electrons with energies between 50 keV and 150 keV were 
expected to be responsible for most of the counts. A strong dependence of 
counting rate on height was observed. Observations with Pioneer 3 in a highly 
elliptical orbit with an apogeum at 107 400 km suggested a double structure 
with an inner radiation belt starting at about 400 km with a maximum at 
about 1.5 te- The counting rates then decreased, but a second radiation belt 
was found between 3 te and 4 te with a maximum at about 3.5 te- The 
depleted region between these two radiation belts was called the slot. 

Subsequent measurements changed this rather simple picture. First, it 
was discovered that the trapped particles not only are electrons in the tens 
and hundreds of kiloelectronvolt range but also protons with energies up to 
more than 30 MeV. And second, the two distinct radiation belts are fictitious. 

Properties and Orbits of Radiation Belt Particles. Figure 8.46 gives a 
more detailed description of the radiation belts. The upper panel is concerned 
with protons, the lower ones with electrons. In the upper panel, solid lines 
give the distribution of protons with energies greater than 30 MeV, and the 
dotted lines represent protons with energies between 0.1 MeV and 5 MeV. 
The high energetic protons dominate the inner part of the radiation belt: its 
lower edge is at about 1.15 te- At lower altitudes the losses of particles due 
to interaction with the atmosphere are too large to allow for a stable trapped 
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Fig. 8.46. Distribution of particles inside the radiation belts. In the upper panel, 
the solid lines give the distribution of protons with energies above 30 MeV, and the 
dotted lines represent protons with energies between 0.1 MeV and 5 MeV. In the 
lower panel, the solid lines give electrons with energies above 1.6 MeV, while the 
dotted lines correspond to energies between 0.04 MeV and 1 MeV. Reprinted from 
W. Kertz [284], Einfuhrung in die Geophysik^ Copyright 1971, with kind permission 
from Spektrum Akademischer Verlag 



population. The maximum of the high energetic protons is at about 1.5 te; 
with increasing height the density decreases. The picture is different for the 
low energetic proton component. Here the fluxes are much higher and a broad 
maximum can be found around 3.5 te- 

If we consider the electrons as shown in the lower panel in Fig. 8.46, the 
pattern is slightly different. Low-energy electrons (dotted line, 0.04-1 MeV) 
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and high-energy electrons (solid lines, energy above 1.6 MeV) have their 
maxima at similar positions, i.e. about 3.5 te- Nonetheless, the high-energy 
component is confined to a smaller spatial region. 

Note that in Fig. 8.46 the radiation belts are given in geomagnetic coordi- 
nates under the assumption of axisymmetry around the dipole axis. Since the 
dipole axis is offset with respect to the center of Earth, in spatial coordinates 
the radiation belts are asymmetric. In particular at the SAA off the coast of 
Brazil, the radiation belts can be found in rather low altitudes. 

Although the upper panel of Fig. 8.46 still suggests a description in terms 
of two separate radiation belts, consideration of the intermediate energies 
not shown in the figure suggests a different picture. Today, we understand 
the radiation belt as a zone of trapped particles with the properties of the 
particles changing continuously with distance. The higher energies can be 
found predominately close to the Earth, thus in the inner part of the radiation 
belt the energy spectrum is rather hard while it steepens with increasing 
distance. 

More recent observations, in particular by SAMPEX (Solar, Anomalous, 
and Magnetospheric Particle EXplorer, see e.g. sunland.gsfc.nasa.gov/ 
smex/sampex/ or surya.umd.edu/www/sampex.html), indicate the existence 
of a distinct trapped particle component inside the inner radiation belt. The 
particles of this new radiation belt differ from the other radiation belt parti- 
cles insofar as their composition and charge states closely resemble those of 
the anomalous component instead of the galactic cosmic rays. The fiux den- 
sities are about two orders of magnitude larger than those of the anomalous 
component outside the magnetosphere. The dynamics of the new radiation 
belt are similar to those of the van Allen belt. 

Nonetheless, the radiation belt often is divided into two zones, an inner 
one with L <2 and an outer one with L > 2. This distinction is not so much 
concerned with the properties of the particles as with stability: in the inner 
zone, the particle populations are very stable and long-lived, while the particle 
populations in the outer zone vary with solar and geomagnetic activity. 

The basics of the motion of the radiation belt particles under undisturbed 
conditions is described by the adiabatic invariants (see Sect. 2.4). Equation 
(2.83) is the mirror condition: it gives the smallest pitch angle which will be 
reflected in a certain mirror configuration. In the magnetosphere (or more 
generally in a dipole field), the field is weakest at the magnetic equator and 
increases towards the poles. Thus, we are interested in the smallest pitch 
angle of a particle to keep it confined in the radiation belt and prevent it 
from being lost due to interaction with the atmosphere. With (8.11) and 
(2.83) we obtain 



-^eq COS Ajn 

\/l + 3sin^ Am 



(8.33) 



where is the magnetic field at the mirror point and is the geomagnetic 
latitude of the mirror point. The bounce period between the two mirror points 
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then can be determined as the integral of the full motion 



Th 




(8.34) 



This equation can be solved numerically after inserting v\\ and yields for the 
bounce period 

LRw 

Tb » , _ ■ (3.7 - 1.6sinaeq) • (8.35) 

Note that the bounce period only weakly depends on the equatorial pitch 
angle: the value in the parentheses is between 2.1 and 3.7, i.e. a variation 
of less than a factor of 2 in bounce period for particles almost standing at 
the equator (pitch angle close to 90°) and particles travelling almost field- 
parallel. The time a particle stays in a stretch d^ of the field line is longest for 
large pitch angles while it is small for small pitch angles. Thus the particle 
spends most of the bounce period close to the mirror points - and this time 
is the same for particles with large and small equatorial pitch angles. 

The angular drift velocity can be obtained similarly. Again, the integral 
can be solved only numerically: 

(vd) ~ (0.35 + 0.15 sin aeq) . (8.36) 



Equation (8.36) gives the drift velocity averaged over one bounce motion. For 
tteq = 0 we obtain for the equatorial ring current 



, 3L2nl+ki„ 
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(8.37) 



where n is the particle density. From (8.36) we obtain the drift period 
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(8.38) 



The solar-wind generated F?-field (dawn-to-dusk field or equatorial traverse 
field) causes an E x B drift with a drift speed 



_ ^ _ EeqL^ 
Beq Be 



(8.39) 



This drift is in the sunward direction. The gradB-drift, on the other hand, 
gives a westward drift for positive ions and an eastward drift for electrons, 
thus both species drift into opposite directions in the dawn side. Close to the 
Earth, the magnetic drift prevails and a symmetrical ring current arises. At 
larger distances, the E x B drift dominates and only a partial ring current 
forms. 
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Fig. 8.47. Scales of particle motion in the Earth’s magnetosphere. Reprinted from 
M. Schulz and L.J. Lanzerotti [468], Particle diffusion in radiation belts, Copyright 
1974, Springer- Verlag 



The typical time scales can be found in Fig. 8.47: the particles gyrate 
around their guiding field line with periods of the order of 10"^ s. They 
bounce back and forth along their guiding center field line with periods of 
about 1 s (north-south oscillation), and they drift around the Earth with a 
period of about 500 s, forming a ring current. Detailed discussions of particles 
in the radiation belts can be found in [320,445,468,544]. 

The particles trapped inside the magnetosphere show a typical pitch angle 
distribution, the loss cone distribution: particles with small pitch angles are 
absent because during the north-south oscillation these particles are not mir- 
rored back at high altitudes but penetrate deep enough into the atmosphere 
to be lost by interaction with atmospheric constituents. 
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Gains and Losses. The radiation belts would be stable were it not for 
two processes: first, magnetic field fluctuations scatter particles during their 
north-south oscillation into the loss cone, and second, during geomagnetic 
activity, the entire structure of the magnetosphere is distorted. Thus the adi- 
abatic invariants can be violated and particles are fed into the loss cone. On 
the other hand, the radiation belts do exist, and thus there are sources re- 
plenishing them. Despite their structure and temporal variability, the average 
properties of the radiation belts are remarkably constant, thus an equilibrium 
between sources and sinks appears to exist. 

The first indications for the lifetime of radiation belt particles came from 
atmospheric A-bomb tests. In the Starfish experiment in 1962, an artificial 
electron population had been injected into the radiation belt. After only 
about 10 years, this population had vanished into the background. However, 
the lifetime of trapped particles is not a universal constant, but depends on al- 
titude and particle properties. The lifetime increases fast from the lower edge 
of the radiation belts to some years at an altitude of about 8000 km (1.25 te) 
and decreases to minutes at the outer edge of the radiation belts [533]. The 
exact variation depends on the external circumstances, in particular the in- 
fluence of solar and geomagnetic activity: under undisturbed conditions, the 
lifetime is larger than during strong solar or geomagnetic activity. 

Particle losses are always due to the interaction of radiation belt particles 
with the atmospheric gas. Significant losses occur only when the atmosphere 
is sufficiently dense to support interactions, that is below an altitude of about 
100 km. Thus, the losses happen at the mirror points where the particles 
come closest to the atmosphere. The chance of getting lost therefore is largest 
for particles with small pitch angles, and overall losses increase when pitch 
angle scattering increases. Under the simplifying assumption that interaction 
happens only at the surface, the equatorial loss cone can be described by 
(8.33). Using the T-shell parameter, the loss cone also can be written as 

sin afq = V4L6 - . (8.40) 

Thus, the loss cone becomes rather small for r > 3 Re which validates the 
above distinction into an inner and outer radiation belt. The physical mech- 
anisms for losses are: 

• Charge exchange with a particle from the neutral atmosphere: a fast ra- 
diation belt proton captures an electron from the atmospheric hydrogen, 
leaving behind a slow proton and continuing itself as a fast hydrogen atom 
through the atmosphere. This mechanism is of particular importance in the 
inner radiation belt for protons with energies below about 100 keV. With 
increasing energy, the interaction time between radiation belt protons and 
atmospheric hydrogen decreases, making the interaction less likely. 

• Nuclear collisions between protons and atmospheric atoms and molecules: 
these are important loss mechanism for protons with energies above 75 MeV 
in the inner radiation belts. 
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• Scattering into the loss cone is the main loss mechanism for electrons and 
protons in the outer radiation belt. Scattering can occur either by viola- 
tion of the second adiabatic invariant due to changes in the field at time 
scales shorter than the north-south oscillation time, or due to pitch angle 
scattering at electrostatic or electromagnetic waves; see Sect. 7.3.5. 

Sources. The sources of radiation belt particles can be divided into the 
“creation” of particles inside the radiation belts or the motion of particles 
into the radiation belts under violation of the second adiabatic invariant. 

The high energetic particles in the inner radiation belts in general are 
created there. The main mechanism is GRAND (Cosmic Ray Albedo Neu- 
tron Decay). Nuclei from the galactic cosmic radiation penetrate deep into 
the atmosphere and interact with the atmospheric gas. A 5 GeV proton, for 
instance, on average creates about seven neutrons during these interactions. 
Some of the neutrons are slowed down, creating the cosmogenic nuclides 
such as radiocarbon (capture of thermal neutrons by atmospheric nitrogen: 
^^N(n,p)^^C) or ^^Be (spallation of nitrogen or oxygen due to the capture of 
fast protons or neutrons). Other neutrons simply escape without interaction. 
Since neutrons are neutrals, their motion is not infiuenced by the geomag- 
netic field, thus some of them might propagate into the radiation belts. But 
neutrons are not stable and decay into a proton, an electron, and an antineu- 
trino. If this decay happens inside the radiation belt, electrons and protons 
are trapped, thus replenishing the radiation belt population. This process is 
called GRAND because the primaries are Cosmic Rays, creating secondaries 
which are partly refiected (Albedo = reflectivity) into the radiation belts, 
where the Neutrons Decay. 

Another source, also based on in situ creation, is the influx of particles 
from the outer magnetosphere. In the outer magnetosphere, particles with 
low energies dominate. Occasionally, these particles can recombine, forming 
neutrals. These neutrals are no longer guided by the magnetic field and even- 
tually propagate into the radiation belts. On the dayside, the neutrals are not 
stable but immediately become ionized by the Sun’s hard electromagnetic ra- 
diation. If this process happens while the neutral is inside the radiation belt, 
an additional electron and ion are fed into the belt population. 

In the outer radiation belt (L > 3), the sources and sinks become more 
complex and are much more intimately related to solar and geomagnetic 
activity. For instance, the fluxes of energetic electrons are increased during 
higher geomagnetic activity and relax only slowly after a strong geomag- 
netic storm. While the maximum of the radiation belt is depleted during the 
storm, at larger distances the flux densities are higher, slowly propagating 
towards smaller L-shells (see Fig. 8.48). Thus the radiation belt is refilled 
with particles from the outer magnetosphere. 

For a fixed distance in space, changes in particle flux can be observed as a 
consequence of changes in the solar wind speed and the related geomagnetic 
activity. In Fig. 8.49 the dashed line gives the variation of the solar wind 
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Fig. 8.48. Fluxes of trapped elec- 
trons with energies above 1.6 MeV 
for different times after a geomag- 
netic storm. The dashed curve (1) 
gives the undisturbed conditions on 
7 Dec. 1962. The other curves are 
taken after the storm on 20 Dec. 
(4), 23 Dec. (5), 29 Dec. (6), and 8 
Jan. (7). The motion of the electrons 
towards the inner magnetosphere is 
clearly visible. Based on [168] 



SOLAR ROTATION 1935 




Fig. 8.49. Flux of energetic electrons above 3.9 MeV in a geostationary orbit 
{solid line) compared with the solar wind speed {dashed line). The polarity pattern 
of the interplanetary magnetic field is shown at the bottom. Reprinted from G.A. 
Paulikas and J.B. Blake [402], in Quantitative modeling of magnetospheric processes 
(ed. W.P. Olson), Copyright 1979, American Geophysical Union 



speed, and the solid line the changes in electron fluxes in a geostationary 
orbit at a height of 36 000 km, well outside the maximum of the radiation 
belts. The electron flux increases in response to an increase in solar wind 
speed with an energy-dependent delay between about 1 and 2 days due to the 
inward propagation of the particles. Thus compression of the magnetosphere 
is connected with an increase in particle fluxes, although the latter is delayed 
depending on the energy of the particles and the position of the observer. 

L- Shell Diffusion. Particles refilling the radiation belts have entered the 
magnetosphere at the polar cusps, either from the outside as solar wind 
plasma or solar energetic particles, or from the ionosphere. Although some 
of these particles might have rather high energies on entering the magneto- 
sphere, most of them will have energies much closer to the plasma’s thermal 
energy than to the energies typically observed in the radiation belts. Thus not 
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only do we need a mechanism to transport these particles into the radiation 
belts but we also need to accelerate them. 

Both transport as well as particle acceleration is induced by magnetic 
fluctuations, as is evident from the observations described in connection with 
Figs. 8.48 and 8.49. The transport mechanism basically is diffusion, although 
convection and drift also contribute to the transport [478]. The acceleration 
is related to the transport because during the inward transport the magnetic 
flux density increases. The acceleration mechanisms are the betatron effect 
and Fermi acceleration. 

In the betatron effect a charged particle gyrates in a magnetic field 
with slowly increasing magnetic flux density. As the flux increases, the 
gyro- frequency increases, too (see (2.28)). Since the angular momentum 
mvri, = mv^ I Uc stays constant, the particle’s kinetic energy increases as 

Wkin = mv^ ! 2 ~ cjc ~ -B . (8-41) 

Thus a particle moving inwards in the magnetosphere gains energy as the 
magnetic field increases. 

Whereas the betatron effect accelerates particles moving perpendicular to 
the field towards regions of higher flux density, the Fermi effect accelerates 
the motion of a particle parallel to the field: according to the second adiabatic 
invariant (2.85), a particle gains energy as its travel path along the magnetic 
field line shortens. As a particle moves inwards in the magnetosphere, the 
field line along which it bounces shortens and the particle gains energy. 

The inward motion of the particles is a diffusive process based on the vio- 
lation of the third adiabatic invariant (flux invariant) when marked variations 
of the magnetosphere occur on time scales smaller than the drift period of 
the particles. Figure 8.50 gives a very simple model: assume a narrow particle 
distribution in the equatorial plane (a). A sudden variation in the solar wind 
leads to a fast compression (expansion) of the magnetosphere (b) . Since this 



(a) (b) (c) 




Fig. 8.50. Effect of an asymmetric sudden compression and slow relaxation of the 
geomagnetic field on a narrow band of equatorially trapped particles. After the 
recovery period, the particles fill the shaded band 
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is a fast change, the magnetic flux inside the particle’s drift orbit is suddenly 
increased (decreased). The held now relaxes slowly (adiabatically) back to its 
original state. As the flux is now conserved, the drift orbit expands (shrinks) 
and the particle orbit has moved in radius. Since the magnetosphere is asym- 
metric, a change in solar wind leads to a stronger variation in the held on 
the dayside than on the nightside. Thus the effect on the particle not only 
depends on the properties of the disturbance but also on the particle’s posi- 
tion in its drift orbit. Therefore the initial distribution is spread in radius (c). 
This process is diffusive, and it is called L-shell diffusion or radial diffusion. 

In sum, the adiabatic heating creates 10-100 keV ring current ions from 
1-10 keV plasma sheet ions. The adiabatic heating is divided into a transverse 
part 




and a longitudinal one 



Wo 




(8.43) 



where = 2 for aeq = 0 and k = 2.5 for aeq 90°: the adiabatic heating 
due to the Fermi effect decreases with increasing path between the mirror 
points. 

The process shown in Fig. 8.50 leads to a spread of the particle distri- 
bution but not necessarily to an inward motion. Since particle streaming in 
diffusion is driven by a gradient (see (7.10)), for undisturbed magnetospheric 
conditions, such as in the particle distributions in Fig. 8.46, we would not 
expect an inward streaming but rather one directed outwards. After a strong 
geomagnetic storm, however, the particle density increases in the outer mag- 
netosphere due to compression as well as to an injection of plasma from the 
tail, leading to a density gradient and therefore a particle streaming towards 
the inner magnetosphere as is evident in the observations shown in Fig. 8.48. 



8.7.2 Galactic Cosmic Rays - St0rmer Orbits 

Galactic cosmic rays have high rigidities and thus their gyro-radii are compa- 
rable to the size of the magnetosphere. Thus concepts such as the adiabatic 
invariants cannot be applied to the motion of these particles: instead, their 
equation of motion has to be integrated along the particle orbit. This was 
first done by C. Stprmer; the orbits thus are called Stprmer orbits. 

In interplanetary space, a galactic proton has a gyro-radius of the order 
of 1/100 AU. If such a proton hits the magnetosphere, it senses a strong 
change in the magnetic field on a spatial scale much smaller than its Larmor 
radius. Thus the simplifying assumption of a uniform magnetic held made for 
the radiation belt particles cannot be applied here. The fate of the particle 
then depends on its pitch angle and the direction and location of incidence: 
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to) Equatorial plane 



30® 30® 




b) Meridian plane 



North 30® 60® 




Fig. 8.51. Orbit of a 
high energetic particle 
in a magnetic dipole 
field in (a) the equa- 
torial plane and (b) 
in a meridional plane. 
Reprinted from J.K. 
Hargreaves [212] (bas- 
ed on [345] , The solar- 
terrestrial environ- 
ment^ Copyright 1992, 
with kind permis- 
sion from Cambridge 
University Press 



particles propagating nearly field-parallel into the polar cusps are less influ- 
enced by the geomagnetic field than particles hitting the magnetosphere in 
equatorial regions perpendicular to the magnetic field. 

The motion of such a high energetic particle is still determined by the 
Lorentz force; however, since the magnetic field varies strongly on a scale 
smaller than the gyro-radius, the equation of motion has to be integrated. 
An example is sketched in Fig. 8.51 where the orbit of a 1 GV particle coming 
from the east is shown in the equatorial plane (a) and the meridional plane 
(b) of a magnetic dipole field. The particle trajectory is irregular with wide 
excursions in latitude and longitude until it finally hits the Earth. 
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SUNSPOT NUMBER 





Fig. 8.52. Intensity of the galactic cosmic radiation observed with neutron monitors 
at different geomagnetic latitudes (from top to bottom): Huancayo (Re = 13 GV), 
Climax (3 GV), Moscow (2.4 GV), and Murmansk (0.6 GV). Reprinted from G.A. 
Bazilevskaya, Observations of variability in cosmic rays, Copyright 2000, with kind 
permission of Kluwer Academic Publishers 



The particle’s orbit depends on the location and direction of incidence. 
The most important result of Stprmer’s work has been the definition of 
allowed and forbidden regions on the ground which can be reached by a 
charged particle travelling towards the Earth. For instance, to hit the Earth 
at a certain magnetic latitude 0c, the particle’s rigidity must be at least 
Pc = 14.9 cos^ 0c 5 with Pc in units of 10^ V or 1 GV. This rigidity is also called 
the cutoflF rigidity: particles with a rigidity Rc can reach all latitudes of 0c and 
above, but particles with smaller rigidities can hit the ground only at higher 
latitudes. This latitude dependence can be seen, for instance, in the neutron- 
monitor counting rates measuring the galactic cosmic radiation. Figure 8.52 
shows in its top panel sunspot numbers, and in its bottom panel count- 
ing rates for neutron monitors with different geomagnetic cutoff rigidities 
Rc’. from top to bottom, the sites of these monitors are Huancayo/Haleakala 
{Rc = 13 GV), Climax {Rc = 3 GV), Moscow {Rc = 2.4 GV), and Murmansk 
{R^ = 0.6 GV). The modulation of galactic cosmic rays with the solar cycle 
affects only the lower energies; see Sect. 7.7 and Fig. 7.29. As a consequence, 
the modulation with the solar cycle almost vanishes at low-latitude stations 
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such as Huancayo/Haleakala, while it increases with increasing geomagnetic 
latitude and decreasing geomagnetic cutoff. 

The extent of the forbidden region is roughly the St0rmer unit Cst = 
y^ME/(-BrL) = For a 1 keV solar wind proton, the forbidden region 

extends for about lOOrE and it certainly will not reach the ground. For a solar 
wind electron, the forbidden region is even wider. 

If a proton of the galactic cosmic radiation has been detected on the 
ground, its origin can be determined by the reverse procedure: we let a nega- 
tively charged particle with the same rigidity propagate out through the geo- 
magnetic field towards infinity. Thus the direction of incidence of the galactic 
cosmic radiation can be determined. On Earth, the galactic cosmic radiation 
is isotropic, while during ground-level events solar protons with gigaelectron- 
volt energies can be detected coming predominantly along the interplanetary 
magnetic field line connecting the Earth to the Sun. 

8.7.3 Solar Energetic Particles - Polar Cap Absorption 

Except for ground-level events, the energies of solar energetic particles are 
too small to allow them to penetrate down to the ground. This is true for 
most of the magnetosphere. At the polar cusps, however, lower energetic so- 
lar particles also can reach the denser atmosphere. Their interaction with the 
atmosphere leads to increased ionization, which in turn leads to the absorp- 
tion of radio waves. Since this absorption is limited to the polar regions, this 
phenomenon is called polar cap absorption (PCA). Owing to the propaga- 
tion time between the Sun and the Earth, a PCA starts a few hours after the 
fiare. A similar effect on the day side ionosphere, the sudden ionospheric dis- 
turbance, on the other hand, starts immediately after flare onset because here 
the increased ionization is due to the flare’s hard electromagnetic radiation. 

The protons causing PCAs typically have energies between 1 MeV and 
100 MeV and can penetrate down to heights between about 30 km and 90 km. 
Thus the ion population created during a PCA lies below the ionosphere in 
an atmospheric layer which normally is neutral, allowing for ion-chemical 
interactions that normally do not occur at these heights (see Sect. 10.4.2). 

Figure 8.53 shows the areas typically affected by PCA. PCAs are limited 
to a small latitudinal ring around the geomagnetic pole. Outside this ring, 
PCAs only occur during very strong geomagnetic disturbances when the geo- 
magnetic cut offs are reduced, allowing particles to precipitate into the denser 
atmosphere at lower geomagnetic latitudes. 

Part of the energy transferred from the solar particles to the atmosphere 
leads to electromagnetic radiation in the visible range, called the polar glow 
aurora (PGA). In contrast to the normal aurora, which is highly variable and 
often well structured, the polar glow aurora is a diffuse red glow of the entire 
sky in high latitudes. Most often it is red since the excited atoms mainly are 
hydrogen, emitting in the visible the Ha line and in the UV range the La 
line. Even if its luminosity is well above the visibility threshold, it often is 
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Fig. 8.53. Areas affected by PCA: the regions inside the inner curves give the 
polar plateaus while the regions outside the outer curves usually are unaffected, 
except during severe geomagnetic disturbances. Reprinted from G.C. Reid [439], in 
Physics of the Sun, vol. Ill (eds. RA. Sturrock, T.E. Holzer, D.M. Mihalas, and K. 
Ulrich), Copyright 1986, with kind permission from Kluwer Academic Publisher 



difficult to detect because of its total lack of any contrasts. Note that the 
polar glow aurora is the only visible effect caused by solar energetic particles; 
the much more impressive aurora on the other hand is caused by plasma from 
the solar wind and the plasma sheet in the geomagnetic tail. 



8.8 What I Did Not Tell You 

So far we have discussed some features of the magnetosphere in rather simple 
terms which allow the application of concepts introduced in earlier sections. 
A complete description of the magnetospheric fields and plasmas, however, 
requires a self-consistent approach. Let us illustrate some neglected effects 
with the example of the magnetopause. For instance, we have described the 
magnetopause as a discontinuity in the magnetic field without adjusting the 
current systems accordingly. In addition, we have ignored the polarization 
field inside the current sheet, which results from the different Larmor radii 
of solar wind electrons and ions but also influences the motion of the very 
same particles, as is sketched in Fig. 8.9. A self-consistent approach, however, 
would have to consider quasi-neutrality and the screening of charges in the 
field of other charges. The characteristic length of the particle motion in the 
magnetopause would then be the inertial length c/uce of an electron instead of 
the gyro-radius. In the calculation of the magnetopause we have also ignored 
the magnetosheath, where the field and plasma properties are quite different 
from those in the solar wind, although both the field and the plasma are 
of solar wind origin. Putting all these effects together, the thickness of the 
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magnetopause would be some hundreds to about 1000 km, much more than 
the ion gyro-radius of about 100 km but much closer to the observations. 

All of these effects can be described analytically; the resulting set of 
equations, however, can be solved only by means of MHD simulations. In 
particular, the dynamic aspects, such as reconnection of planetary and in- 
terplanetary fields and the accompanying plasma exchange, can be incor- 
porated into such models. Overviews of magnetospheric modeling can be 
found in [260,543], and some articles can be found in [385]. The modeling 
of the magnetospheric plasma is discussed in [364], and models of the ra- 
diation belts can be found in [320]. Resources on the Web for modeling of 
the magnetosphere and/or the interaction between the solar wind and the 
magnetosphere include those of the Space Plasma Simulations Group at the 
University of California, Los Angeles, at www-spc.igpp.ucla.edu/, and the 
Space Plasma Physics Modeling Group at the University of Washington at 
WWW . geophys . Washington . edu/Space/SpaceModel/modelling . html. 



8.9 Summary 

The magnetosphere is shaped by the interaction between the solar wind and 
the terrestrial magnetic field. Since both are variable the magnetosphere 
shows a slow evolution as well as fast fluctuations. The basic features in the 
magnetospheric field topology are the magnetopause, the polar cusps, and 
the tail. The magnetopause marks the separation between the planetary and 
the interplanetary magnetic fields. In front of the magnetopause a bow shock 
develops where the solar wind is slowed down to subsonic speeds. Thus the 
region between bow shock and magnetopause, the magnetosheath, is filled by 
a hot but slow solar wind flow. 

Although the magnetopause separates the magnetic fields, the solar wind 
plasma can still penetrate through it, filling part of the outer magnetosphere. 
In the inner magnetosphere, the cold and dense ionospheric plasma fills the 
plasma sheet, as is evident from the high content. The region between this 
inner plasmasphere and the outer magnetosphere contains a rarefied plasma 
of mixed origin: under undisturbed conditions, the geosphere is dominated by 
the solar wind plasma while during geomagnetically active periods an outflow 
of ionospheric plasma is added. 

The particle populations in the magnetosphere have different sources and 
can be divided into stable and transient populations. The only stable compo- 
nent are the radiation belt particles, forming the ring current. Its variations 
lead to geomagnetic activity, detected on the ground as geomagnetic storms. 
The radiation belts are only a quasi-stable population since particles are lost 
due to interaction with the denser atmosphere and are created due to cosmic 
rays (GRAND) or the transport and acceleration of lower energetic parti- 
cles or solar wind plasma from the outer magnetosphere towards the Earth. 
Galactic cosmic rays have much higher rigidities and are not trapped inside 
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the geomagnetic field. Instead, depending on their direction of incidence, they 
can either penetrate down to the denser atmosphere or are defiected back into 
space. Particles with lower rigidities, such as solar energetic particles or the 
solar wind, can penetrate into the magnetosphere only at the polar cusps 
and, on interaction with the atmosphere, lead to polar cap absorption. 

The dynamic magnetosphere manifests itself in geomagnetic disturbances 
and aurorae. The energy source for these phenomena is the solar wind; the 
coupling between the solar wind and the magnetosphere is due to magneto- 
spheric convection. The level of geomagnetic activity is determined by the 
solar wind fiuctuations and the direction of the interplanetary magnetic field. 
If the latter has a southward component, the magnetosphere has an open 
configuration, more energy and plasma are fed into it, and its responses to 
solar wind variations are much stronger. The variations in the solar wind 
responsible for geomagnetic disturbances are fast solar wind streams, often 
leading to recurrent geomagnetic disturbances, as well as shocks and the 
ejecta driving them. While the recurrent disturbances can best be observed 
during the solar minimum when stable and recurrent fast and slow solar wind 
streams have developed, geomagnetic disturbances in response to transient 
phenomena are much more frequent during the solar maximum, since fiares, 
CMEs, and interplanetary shocks are more frequent at these times. 



Exercises and Problems 

8 . 1 . Determine the magnetic flux density and the direction of the field for 
your home town (assume a simple dipole field). Compare with Fig. 8.3. To 
which L-shell is your home town connected magnetically? Determine also the 
cut off rigidity and the Stprmer unit for this rigidity. 

8.2. A 10 keV proton with an equatorial pitch angle of 40° moves from L = 6 
to L = 1.5. Assume a dipole field and calculate the energy gained by Fermi 
acceleration (second adiabatic invariant) during this motion. 

8 . 3 . Assume a sinusoidal variation with period T = 1 h in the solar wind 
speed with an amplitude of ±40 km/s around an average of 400 km/s. De- 
termine the speed of the magnetopause and the maximum and minimum 
stand-off distances. 

8 . 4 . Stprmer orbits are calculated for a terrestrial dipole field. Give a quali- 
tative statement about the errors made in neglecting the actual shape of the 
magnetosphere. Try to consider the influence of the shifted dipole as well as 
the different topologies in the noon and midnight directions. 

8.5. Fig. 8.48 shows the inward L-shell diffusion of energetic electrons. Try 
to estimate the diffusion coefficient for this process (assume that the energy 
of the particles does not change during the inward motion). 
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8.6. The magnetopause is determined by the equilibrium between the gas- 
dynamic pressure of the solar wind and the magnetic pressure of the geo- 
magnetic field. The magnetic pressure of the interplanetary magnetic field is 
neglected, as is the gas-dynamic pressure of the plasmasphere. Determine the 
error due to this approximation. 

8.7. Give an order-of-magnitude estimate of the Chapman-Ferraro current. 




9 Planetary Magnetospheres 



Empty space is like a kingdom, and heaven and earth 
are no more than a single individual person in that 
kingdom ... How unreasonable it would be to suppose 
that besides the heaven and earth which we can see 
there are no other heavens and no other earths? 

Tang Mu, 13th century 



A magnetosphere is not a typical terrestrial phenomenon. Instead, magneto- 
spheres can be found around all magnetized bodies embedded in a plasma 
flow. Even around unmagnetized bodies (comets, planets without a magnetic 
fleld) a cavity is formed by the interplanetary magnetic fleld frozen into the 
deflected solar wind flow. In the solar system, all planets except Mars and 
Venus have a magnetosphere. Although they are different in size, for most 
of them the shape and the properties of the magnetosphere are similar to 
those in the terrestrial one. Special features are the large size and the flat 
inner structure of the Jovian magnetosphere and the oscillation of Neptune’s 
magnetosphere between pole-on and Earth-like. 



9.1 The Planets 

Table 9.1 lists the planets with their distance from the Sun, their sizes, and 
basic orbital parameters. The planets are divided into two groups, the inner, 
Earth-like planets Mercury, Venus, Earth, and Mars, and the outer, gaseous 
giants Jupiter, Saturn, Uranus, and Neptune. The Earth-like planets all have 
comparable radii with Mercury being the smallest, its radius is only about 
one-third of the Earth’s radius. Since they all have a solid crust and a heavy, 
iron-rich core, their densities are relatively high, ranging between 4 g/cm^ 
and 6 g/cm^. All Earth-like planets have atmospheres, although these are 
vastly different in density, composition, and temperature: Mercury has a very 
thin atmosphere which can hardly be recognized as an atmosphere, while 
the carbon-dioxide atmosphere on Venus is very thick, leading to a strong 
greenhouse effect with surface temperatures of about 750 K. The inner planets 
have few or no moons and no rings. 
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Table 9.1. Properties of the planets in the solar system 





Solar 

distance 

(AU) 


Sidereal 

period 


Spin Average 
period density 
(days) (gm/cm^ 


Surface 
gravity 
) (N/kg) 


Mercury 


0.39 


88 d 


56.8 


5.4 


3.6 


Venus 


0.72 


225 d 


243 


5.1 


8.7 


Earth 


1.00 


365 d 


1 


5.5 


9.8 


Mars 


1.52 


1 yr 322 d 


1.03 


4.0 


3.7 


Jupiter 


5.20 


11 yr 315 d 


0.41 


1.3 


26.0 


Saturn 


9.55 


29 yr 167 d 


0.44 


0.7 


11.2 


Uranus 


19.22 


84 yr 7 d 


0.72 


1.18 


9.4 


Neptune 


30.11 


164 yr 280 d 


0.67 


1.56 


15.0 



The outer planets, Jupiter, Saturn, Uranus, and Neptune, again are simi- 
lar. They are large, gaseous giants, mainly consisting of hydrogen and helium. 
Thus their average densities are low with values close to 1 g/cm^. Prom the 
outer atmosphere towards the center of the planet, the pressure increases up 
to more than a million times the surface pressure on Earth. Here the hydro- 
gen first becomes liquid and later metallic. It is still under debate whether 
the outer planets have solid cores. They have many moons and ring systems 
affecting the plasma and particle populations in the magnetospheres. 

The energetics of the outer planets are surprising: they radiate more en- 
ergy back into space than they receive from the Sun. Jupiter radiates back 
twice the energy received from the sun; on Saturn this ratio is about 3.5. 
The additional energy either is gravitational, released during the contraction 
of the planet, or a remnant from the creation of the solar system. Thus the 
large gaseous planets are more similar to the Sun than to the inner planets. 

The outermost planet, Pluto, cannot be fitted into this scheme: it appears 
to be a solid piece of rock and shows no similarities with the gaseous giants. 
Pluto’s orbit is unusual, too. It is highly eccentric and inclined with respect 
to the plane of the ecliptic. It is likely that Pluto was not formed together 
with the solar system but is a captured asteroid. 

A review of planetary magnetospheres can be found in [65], and the mag- 
netospheres of the outer planets are discussed in [452]. 



9.2 Planets with a Magnetic Field 

Aside from the outer gaseous giants, only Earth and Mercury have a magnetic 
moment strong enough to support a magnetosphere. We will now discuss the 
structures of the magnetospheres together with the plasma populations and 
their relationships to moons and rings. A comparison between the different 
magnetospheres follows in the subsequent section. 
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9.2.1 Mercury 

Mercury’s magnetosphere is rather simple because both the planet’s rotation 
axis as well as the magnetic field axis are roughly perpendicular to the plane 
of ecliptic. Since Mercury’s magnetosphere is very small with a stand-off 
distance of only 1.1 rp, it neither accommodates radiation belts nor a plasma 
population. 

Despite this simplicity, Mercury’s magnetosphere is very dynamic. Com- 
pared with the other planets, Mercury’s orbit is highly eccentric {e = 0.206) 
with a perihelion^ at 0.308 AU. In the perihelion, fiuctuations in the solar 
wind can create such a strong kinetic pressure that the typical magnetopause 
distance is smaller than the planet’s radius. Thus a magnetosphere in the 
sense of a planetary magnetic field enclosing the planet no longer exists, 
and the magnetic field is swept into the magnetotail. This variability of the 
magnetosphere does not support stable radiation belts. 

Nonetheless, Mercury’s magnetosphere contains a thermal plasma con- 
sisting mainly of H, He, O, Na, and K ions as well as electrons. The lighter 
species, H and He, are swept up out of the solar wind. The heavier ions are 
created on the dayside of the planet, as indicated in Fig. 9.1: solar UV radi- 
ation, and to a lesser extent also the solar wind, knock Na, K, and O out of 
the planet’s surface. These neutrals immediately become ionized by the solar 
UV radiation and then are kept in the planet’s magnetic field. 

The tail of Mercury’s magnetosphere even shows dynamic phenomena 
which closely resemble the substorms known from the terrestrial magneto- 
sphere. Because Mercury has neither an atmosphere nor an ionosphere, this 
observation indicates that the ionosphere is not a necessary condition for 
the occurrence of substorms. For the planetary consequences of a substorm, 
nonetheless, the existence of an ionosphere makes a difference. 




Fig. 9.1. Structure of Mercury’s mag- 
netosphere. The creation of a thermal 
plasma population due to incident so- 
lar UV radiation is indicated 



^ In an elliptical orbit the perihelion is the position closest to the Sun. Its coun- 
terpart is the aphelion. 
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9.2.2 Jupiter 

While Mercury’s magnetosphere is the smallest and simplest inside the solar 
system, the Jovian one is the largest and most complex magnetosphere. A 
very attentive and detailed description based on the Pioneer and Voyager 
fly-bys can be found in [131]; results obtained by Galileo are summarized 
in Science 272. In addition, the Jovian magnetosphere appears to be highly 
variable, as can be inferred from a comparison of the magnetic held data 
obtained by Voyager in 1979, Ulysses in 1992, and Galileo in 1995. 

The Jovian magnetic field can be approximated as a dipole with its axis 
inclined by 11° with respect to the planet’s rotation axis and slightly offset 
with respect to it. The magnetic moment of Jupiter exceeds that of Earth by 
four orders of magnitude; the magnetic field at the planet’s surface is about 
one order of magnitude larger. Since at Jupiter the solar wind intensity has 
decreased markedly, the magnetosphere extends far above the planet. The 
fast rotation of the planet with a period of slightly less than 10 h adds 
another feature to the Jovian magnetosphere: the centrifugal forces in the 
equatorial plane are very strong, resulting in an outward plasma flow. The 
planetary magnetic field frozen into this plasma is carried out, too, leading to 
the magnetic field configuration shown in Fig. 9.2. Therefore the inner Jovian 
magnetosphere is much flatter than the dipole-like inner magnetospheres of 
the other planets. 




Fig. 9.2. Jovian magnetosphere: the planet’s fast rotation leads to a concentration 
of plasma in the equatorial plane which carries out the magnetic field. Reprinted 
from J.A. Simpson and B.R. McKibben [486], in Jupiter (ed. T. Gehrels), Copyright 
1976, with kind permission from The University of Arizona Press 
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— > — Magnetic Field Lines 

Fig. 9.3. Plasma flows, currents, and magnetic fleld lines in the inner Jovian mag- 
netosphere. Based on F.M. Neubauer [374], in Plasmaphysik im Sonnensystem (eds. 
K.-H. Glassmeier and M. Scholer), Copyright 1991, reprinted with kind permission 
from Spektrum Akademischer Verlag 



This flattening of the magnetosphere is associated with a current system 
(see Fig. 9.3): the outward plasma flow is accompanied by an electric cur- 
rent directed radially outward. Without this current the plasma would not 
corotate with the planet but stay behind. The current system is closed by 
field-parallel currents through the ionospheric dynamo region. An additional 
current, separating the fields of opposite polarity, flows around Jupiter in the 
equatorial plane. Currents inside the Jovian magnetosphere can exceed lO^A. 

Within about 15 planetary radii, the dipole field is dominant. In this re- 
gion, the radiation belts are observed; see Figs. 9.12 and 9.13. The particles 
gyrating inside the radiation belts emit synchrotron radiation with frequen- 
cies up to some megahertz, making Jupiter a strong radio source. 

Some of Jupiter’s moons are inside the magnetosphere. Since the relative 
speed between the moons and the magnetospheric plasma is smaller than the 
Alfven speed, no bow shocks develop in front of the moons. They are unmag- 
netized bodies, and thus no magnetospheres develop around them. Therefore 
Jupiter’s magnetospheric plasma can interact freely with the moon’s surface. 
Since absorption exceeds the sputtering, the particle intensities in the radia- 
tion belts are reduced at the moon’s orbit; see Figs. 9.12 and 9.13. 

One moon, lo, is also a plasma and particle source: its volcanism injects 
particles, in particular S (see Table 9.2). This matter immediately becomes 
ionized and is accelerated to the speed of the corotating magnetosphere. Small 
differences in the initial conditions and in the interaction between charged 
particles and neutrals lead to the formation of a torus along the moon’s orbit 
instead of just a cloud of ionized gas around the moon (see Fig. 9.4). Charged 
particles flowing field-parallel out of this torus create Whistler waves and, on 
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Table 9.2. Sources of thermal plasma in the magnetospheres of the outer planets. 
The contributions of the planetary ionospheres and moons can be identified by their 
unusual composition, in particular the large amount of ions heavier than He 





Jupiter 
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Uranus 


Neptune 


Sources 


lo 

ionosphere 


ionosphere, 
icy moons, 
solar wind. 
Titan 


ionosphere, 

H-corona 


ionosphere, 
solar wind, 
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Composition 
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S, 0, H 2 O, N 


H, OH, H 2 , 0, 
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~ 3 X 10^® i/s 
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10 ^® i/s 


3 X 10^® i/s 
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months, years 


months 


days 
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Auroral Zone 




Fig. 9.4. Plasma torus around lo’s 
orbit and flux tubes connecting to 
the auroral regions. Because the 
planet’s magnetic axis is tilted with 
respect to the axis of rotation, lo 
weaves up and down within the 
plasma torus 



interaction with Jupiter’s ionosphere, a glowing aurora in the polar regions, 
lo’s plasma torus has a sharp inner boundary, while its outer boundary is less 
well-defined and extends outwards into the magnetosphere. 



9.2.3 Saturn 

Saturn’s magnetosphere is a rather simple one. It is axial symmetric because 
both the magnetic field axis and the axis of rotation are perpendicular to the 
plane of ecliptic. Since Saturn’s magnetic field is much weaker than Jupiter’s, 
the magnetosphere is smaller, more closely resembling the one at Earth. 
Plasma tori with different dominant species exist, as indicated in Fig. 9.5 
with particle densities well below the ones in the Jovian magnetosphere. 

Saturn’s magnetosphere can be divided into four distinct parts. The outer 
part is filled with a plasma corotating with the planet. The highest plasma 
density is observed at about 6 planetary radii, which is still deep inside the 
magnetosphere. The plasma mainly consists of N, O, and OH. Part of this 
plasma most likely originates in the ring system, and trapped particles in the 
sense of a radiation belt are not observed. 

Moving inwards, the radiation belts start with a slot region, extending 
from about 6.5 rp inwards to about 4 Vp. Here the particle fluxes are re- 
duced because of absorption by the moons Dione, Thetis, and Enceladus; 
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Fig. 9.5. Saturn’s magnetosphere with the hydrogen torus created by Titan, the 
oxygen torus of Thetis, and the positions of the inner moons and the ring system. 
Reprinted from D.A. Bryant [65], in Plasma Physics (ed. R. Bendy), Copyright 
1993, with kind permission from Cambridge University Press 



see Figs. 9.12 and 9.13. In the region between 4 and 3.1 planetary radii, i.e. 
between the orbit’s of Enceladus and Minas, the fluxes of radiation belt par- 
ticles increase again, as does their energy. Farther in, in the region occupied 
by the ring system, particle fluxes drop sharply. 

Titan, one of Saturn’s moons, has a relatively thick atmosphere, mainly 
consisting of methane and nitrogen. As Titan loses particles out of this at- 
mosphere, it creates a torus around its orbit with a width of about 2 Saturn 
radii inside the orbital plane and 7 to 23 planetary radii perpendicular to it. 
The interaction between Titan and Saturn’s magnetosphere is not as contin- 
uous as the one between the Jovian magnetosphere and lo because part of 
Titan’s orbit lies outside the magnetopause in the magnetosheath or even in 
interplanetary space. 

Saturn’s magnetosphere contains another plasma component which we 
have not encountered in the other magnetospheres: strongly ionized dust 
particles with sizes between a few microns and the size of a small moon. 
These dust particles modify the plasma properties. Surrounded by an electron 
cloud, they behave like the nuclei of large quasi-atoms. 

9.2.4 Uranus 

Uranus is a special planet in the solar system in so far as its axis of rotation 
nearly parallels the plane of ecliptic. Thus one pole of the planet always is 
directed towards the Sun. For the other planets, the axis of rotation is more 
or less perpendicular to the ecliptical plane, corresponding to our concepts 
about the formation of the solar system. It is speculated that a collision 
between Uranus and a relatively large and massive body might have tipped 
over the planet’s axis of rotation. 

On the basis of our observations at Mercury, Earth, Jupiter, and Sat- 
urn, as well as our understanding of the magnetohydrodynamic dynamo, we 
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would expect the magnetic field axis to be approximately parallel to the axis 
of rotation. If this were true, a pole-on magnetosphere should result: the mag- 
netic field axis lies in the plane of ecliptic and the solar wind blows directly 
in the direction of the polar cusp. As a consequence, the magnetotail would 
be quite different: instead of one current sheet separating the northern and 
southern lobes, within each lobe a current sheet would result. A sketch of 
such a pole-on magnetosphere is shown in the right panel of Fig. 9.8. 

The Voyager encounter with Uranus, however, revealed a magnetosphere 
similar to the one at Earth (see Fig. 9.10). The magnetometer measurements 
on Voyager indicated an inclination of the dipole axis of 58.6° with respect 
to the rotation axis. In addition, the magnetic dipole is offset with respect 
to the center of the planet (see Fig. 9.6). Thus the magnetosphere is not 
axial-symmetric as expected for a pole-on magnetosphere but rotates around 
the planet’s rotation axis. 

Two interpretations for the large angle between the dipole axis and the 
axis of rotation are suggested. First, Uranus just might undergo a polarity 
reversal. Second, the tilt between the two axes might originate in an unusual 
inner structure of Uranus, in particular in locations where the planetary dy- 
namo operates. This second interpretation is plausible in so far as on Neptune, 
which has a similar inner structure, a similar tilt between the dipole and the 
rotation axes is observed, although Neptune’s axis of rotation is inclined with 
respect to the ecliptic plane by only 28.8°. 

Inside the magnetosphere, radiation belts exist; however, the particle 
fiuxes and energies are rather small; see Figs. 9.12 and 9.13. In particular, in 
the electron fiuxes the infiuence of the moon on the radiation belt populations 
is obvious: along the moon’s orbit, the particle fluxes are reduced. 

Compared with all other magnetospheres in the solar system, the mag- 
netosphere of Uranus seems to be the only closed one: because the magne- 
tospheric plasma does not contain any a-particles, the solar wind seems to 
be completely frozen-out. Plasma sources most likely are the ionosphere, ion- 
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ization of the vast hydrogen corona of the planet, and, to a lesser extent, 
sputtering from the moon’s surfaces. 

9.2.5 Neptune 

Neptune’s magnetic field and the resulting magnetosphere proved to be even 
more complex. Neptune’s dipole axis is inclined by 47° with respect to the 
axis of rotation, which in turn is inclined by 28.2° with respect to the plane of 
ecliptic (see Fig. 9.7). The dipole not only is eccentric but also shifted along its 
axis towards its south pole. During one rotation of the planet, the inclination 
of the dipole axis with respect to the plane of ecliptic varies between 90° — 
28.8° -47° = 14.2° and 90° + 28.8° -47° = 71.8°. Thus within 8 h, Neptune’s 
magnetosphere oscillates between nearly pole-on and Earth-like (see Fig. 9.8). 
These oscillation were observed during the Voyager fiy-by. 




Fig. 9.7. The magnetic dipole of Nep- 
tune is inclined by 47° with respect to 
the rotation axis of the planet and also 
shifted towards one of the poles 




Fig. 9.8. Neptune’s magnetosphere can be similar to the terrestrial one {left panel) 
while half a rotation later it is a pole-on magnetosphere 
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Within the orbit of Triton, a radiation belt exists in Neptune’s magneto- 
sphere with fluxes much lower than those observed at Uranus. Plasma sources 
are the solar wind, the atmosphere, sputtering from the moons, and heavier 
ions from Triton. The plasma density in the outer magnetosphere is too small 
to distort the held signiflcantly. Neptune’s magnetosphere appears to be the 
least active in the solar system, being only weakly influenced by the variable 
solar wind. 

9.3 Planets Without a Magnetic Field 

The interaction of the solar wind with unmagnetized planets is studied best 
for Venus; our knowledge about the Martian magnetosphere at best is poor. 
Venus interacts with the solar wind in the same way comets do. 

The dayside of Venus’s “magnetosphere” is shown in Fig. 9.9. As the 
solar wind is slowed down to subsonic speed, a bow shock develops. In the 
magnetosheath behind the bow shock, the plasma and the interplanetary held 
are compressed. The magnetosheath is separated from Venus’s ionosphere by 
the ionopause. Analogous to the magnetopause, the ionopause is deflned as 
an equilibrium between the kinetic plasma pressure and the magnetic fleld 
pressure, only this time it is the planetary plasma and the combined pressure 
of the solar wind and the interplanetary magnetic fleld. The latter is frozen 
out of the ionospheric plasma. Although Venus has a magnetic fleld, it is 
too weak to support a magnetosphere in the classical sense: the subsolar 
magnetopause would lie inside the planet. 



Fig. 9.9. The interaction be- 
tween the solar wind and the 
dayside of Venus leads to an 
ionopause and a bow shock. 
Reprinted from T. Encrenaz 
and J.J. Bibring [146], The 
solar system, Copyright 1990, 
Springer Verlag 




Magnetosheath 
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The solar wind deflected around the planet forms a cavity which is nearly 
void of any plasma. Since the magnetic field lines forming the magnetotail 
are of opposite polarity, a current system similar to the one observed in the 
geomagnetic tail evolves. Depending on the solar wind conditions, strong 
magnetic fields can be observed at a height of a few hundred kilometers 
above the planet’s surface. These fields, however, are not of planetary origin 
but stem from the compressed and deflected interplanetary field. 



9.4 Comparison of Planetary Magnetospheres 

Although the physical mechanisms shaping a magnetosphere are the same for 
all planets, the properties of these magnetospheres can be different, depending 
on the magnetic moment of the planet, its rotation period, plasma sources 
and sinks, and the local solar wind properties. In this section we shall give a 
comparison of the magnetospheres discussed above. 

9.4.1 Structures of Planetary Magnetospheres 

In front of all magnetospheres a bow shock develops where the supersonic so- 
lar wind is slowed down to subsonic speeds. Behind the bow shock, structures 
are different. The two neighbors of Earth, Mars and Venus, both have only 
weak magnetic fields (see Table 9.3). Thus no well-structured magnetosphere 
develops, instead an ionopause/magnetopause separates the interplanetary 
and planetary plasmas. This does not result from an interaction between the 
solar wind and the planetary magnetic field but between the solar wind and 



Table 9.3. Parameters of planetary magnetic fields and magnetospheres. The col- 
umn ‘angle between axes’ gives the angle between the planet’s magnetic field axis 
and its rotation axis. In the column ‘plasma sources’ the ‘W’ indicates solar wind, 
‘A’ indicates planetary atmospheres and ionospheres, and ‘S’ satellites 
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the planetary ionosphere. The interplanetary magnetic field then is frozen-out 
of the planet’s ionospheric plasma. 

The most common type of magnetosphere is Earth-like. It also can be 
found at Mercury, Saturn, Uranus, and, at least for part of its rotation, 
around Neptune. The complexity of these magnetospheres depends on the 
angles of the planetary dipole and the axis of rotation with respect to each 
other and to the solar wind fiow. The sizes of the Earth-like magnetospheres 
are different, depending on the planet’s magnetic moment. 

The most complex magnetosphere can be found on Jupiter: although 
the axis of rotation and the magnetic field axis are parallel to each other 
and nearly perpendicular to the plane of the ecliptic, the magnetosphere is 
far more complex than the Earth-like ones because centrifugal forces from 
Jupiter’s fast rotation stretch the magnetosphere outward in the equatorial 
plane. 

The parameters of the planetary magnetospheres, such as the magnetic 
moment, the angle between the magnetic field axis and the axis of rotation, 
the typical distance of the magnetopause in planetary radii Vp as observed and 
as calculated from the pressure balance, as well as plasma sources are listed 
in Table 9.3 with ‘W’ for the solar wind, ‘A’ for planetary atmospheres and 
ionospheres, and ‘S’ for the planetary satellites. Note the relation between the 
rotation period and the dipole moment: with decreasing rotation period, the 
dipole moment increases because the MHD dynamo works more efficiently. 
Mars is an exception to this rule: most likely, the planet has cooled down 
too much to allow for a liquid core. Thus the dynamo process does not work. 
Nonetheless, the visit of Mars Global Surveyor to Mars revealed a magnetic 
field close to the planet about an order of magnitude larger than expected, 
and in size comparable to Mercury’s field. But this field does not result from 
a planetary dynamo, but rather appears to originate in ferromagnetic rocks 
in the planet’s crust [1]. 



9.4.2 Sizes 

Jupiter not only has the most complex magnetosphere but also has the largest 
one. This is true not only if the subsolar distance of the magnetopause is 
expressed in units of the planet’s radius but, since Jupiter is the largest 
planet, also in absolute numbers. The Jovian magnetosphere is also the largest 
structure in the solar system, even dwarfing the Sun in size; see Fig. 9.10. 

One parameter that defines the size of a magnetosphere is the stand-off 
distance expressed in units of the planetary radius Vp. This “typical magneto- 
pause distance” is given twice in Table 9.3: first as the observed value and 
second as calculated from the balance between the magnetic pressure of the 
planetary field and the kinetic pressure of the solar wind. Both values agree 
quite well, and the variability of the observed magnetopause distances refiects 
the high variability in solar wind pressure. 
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Fig. 9.10. Relative sizes of the planetary magnetospheres. Reprinted from C.T. 
Russell and R.J. Walker [452], Introduction to Space Physics^ (eds. M.G. Kivelson 
and C.T. Russell), Copyright 1995, with kind permission of Cambridge University 
Press 



The typical magnetopause distances order with the magnetic moment of 
the planet: the inner planets all have very small magnetospheres, extending 
less than one planetary radius above the planet’s surface. The only exception 
is Earth with its relatively high magnetic moment. The outer planets have 
even higher magnetic moments, corresponding to larger stand-off distances. 
Although at the outer planets the stand-off distances expressed in planetary 
radii are not much different from the Earth’s, these magnetospheres are much 
larger because the planets themselves are larger; see Fig. 9.10. 



9.4.3 Plasma Sources 

A magnetosphere often is called a magnetic cavity: the solar wind plasma 
cannot penetrate into this cavity. Nonetheless, as we have already seen in 
Chap. 8, the cavity is not void of any plasmas or particle populations. Plasma 
sources in the Earth’s magnetosphere are the solar wind and the terrestrial 
atmospheres and ionospheres, see the last column in Table 9.3. The same 
sources are available in Venus’s magnetosphere. Mercury, on the other hand, 
which is too small and too close to the Sun to support its own atmosphere, 
picks up all the magnet ospheric plasma from the solar wind. 
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Going farther out to the gaseous giants, we find another plasma source: for 
these planets at least some of the moons and the ring systems are inside the 
magnetosphere. These plasma sources easily can be identified by their unusual 
composition: in Chap. 8 we mentioned that the relatively high contribution 
of points to a partly atmospheric origin of the terrestrial magnetospheric 
plasma. Moons and rings can contribute to the magnetospheric plasma popu- 
lations by even more complex mechanisms, occasionally leading also to more 
exotic species, (a) On hitting the solid surface of a moon, energetic ions can 
knock out neutral atoms. These neutrals become ionized by charge-exchange 
with the surrounding plasma. This process, called sputtering, appears to be 
relevant in the inner magnetosphere of Saturn, (b) Interactions between the 
dust of the planetary ring system and the surrounding magnetospheric plasma 
might create an additional plasma component. This process also is discussed 
as a likely explanation of the spokes in Saturn’s rings, (c) Vulcanism on a 
planetary moon directly injects particles into the magnetosphere. A promi- 
nent example is lo, a small moon in the Jovian system which is kneaded 
so thoroughly by Jupiter’s strong gravitational pull that its core is molten, 
allowing for volcanism. lo mainly injects S'^, S^"^, and 0+ into the Jovian 
magnetosphere. In time, a plasma torus has formed along lo’s orbit, consist- 
ing of these ions, (d) The atmospheres of moons also directly inject particles 
into planetary magnetospheres. In the solar system, two moons with atmo- 
sphere are known: Titan, with an atmosphere basically consisting of nitrogen 
and methane with methane being available in all three phases, compara- 
ble with the water on Earth, orbits around Saturn, and Triton, with a thin 
atmosphere of nitrogen and methane, revolves around Neptune. Along their 
orbits, these moons have also created plasma tori; however, the intensities are 
much smaller than the ones observed along lo’s orbit, (e) And finally, moons 
also provide a loss mechanism for the magnetospheric plasma: on hitting the 
moon’s surface, a particle might be absorbed. 



9.4.4 Upstream of the Bow Shock: The Foreshocks 

Since all planets in the solar system have a bow shock, the shock enthusi- 
ast might wonder whether all these shocks accelerate particles. If they do, 
we would expect to observe a foreshock with plasma turbulence and ener- 
getic particles similar to the one in front of the terrestrial bow shock (see 
Sect. 7.6.4). And indeed, a foreshock can be found in front of all planets. But 
like the magnetospheres themselves, these foreshocks are variable. Their prop- 
erties depend on the angle between the interplanetary magnetic field and the 
bow shock. Since the curvature of the Archimedian field line increases with 
increasing radial distance, the angle between the field line and the bow shock 
changes, too. With increasing radial distances other parameters also change, 
such as the strength of the interplanetary magnetic field and the temperature 
and kinetic pressure of the solar wind plasma. 
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Fig. 9.11. Frequencies of 
waves upstream of plane- 
tary magnetosphere in de- 
pendence of the interplane- 
tary magnetic field strength. 
Reprinted from C.T. Russell 
et al. [454], J. Geophys. Res. 
95, Copyright 1990, Ameri- 
can Geophysical Union 



Despite these vastly different local conditions, all planets have foreshocks. 
Although they differ from the terrestrial foreshock, they nonetheless show 
the same general features, such as a resonance between waves and particles 
and, depending on 0Bn? regions with different particle distributions, reflecting 
the underlying acceleration mechanisms. Except for the Jovian electrons, the 
accelerated particles have rather low energies in the kiloelectronvolt to tens 
of kiloelectronvolt range. Depending on the planet’s distance from the Sun 
and the magnetic field strength at this distance, the frequencies of the waves 
resonating with the particles change: the lowest frequencies are observed at 
the outermost planets while the foreshock of the innermost planet shows the 
highest frequencies (see Fig. 9.11). 



9.4.5 Radiation Belts 

Except for Jupiter, all the outer planets have radiation belts that closely 
resemble the terrestrial one. Radial diffusion transports particles perpendic- 
ular to the magnetic field and pitch-angle scattering scatters particles into 
the loss cone from where they can penetrate deep into the atmosphere. The 
Jovian magnetosphere is different: lo lies deep inside the magnetosphere and 
is a significant source of matter and energy. The resulting radiation belt is 
humangeous compared with the typical radiation belts of Earth or the other 
outer planets. It is not only the large spatial extent, but also the high energies 
and fluxes of the particles which make the Jovian radiation belts a hazard 
even for unmanned spacecraft. 
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Fig. 9.12. Fluxes of energetic electrons in the magnetospheres of Earth, Jupiter, 
Saturn, and Uranus. Reprinted from C.T. Russell and R.J. Walker [452], in Intro- 
duction to Space Physics (eds. M.G. Kivelson and C.T. Russell), Copyright 1995, 
with kind permission from Cambridge University Press 



Figures 9.12 and 9.13 show cuts through the radiation belts of electrons 
and protons for Earth, Jupiter, Saturn, and Uranus. Neptune’s radiation is 
not shown; it is very similar to the one of Uranus, although the particle fluxes 
are significantly lower. The radiation belts of the different planets are quite 
similar: the highest fluxes always are observed just above the planetary at- 
mosphere. Saturn is an exception to this rule in so far as the maximum of the 
radiation belts is just outside the ring system. In all magnetospheres particles 
with high energies are limited to the inner radiation belts while lower energy 
particles can also be observed farther out. Thus the energy spectrum is hard- 
est close to the planetary atmosphere; with increasing radial distances the 
spectrum steepens. Particle fluxes and energies, on the other hand, are quite 
different. For instance, the flux of >3 MeV electrons in the Jovian magneto- 
sphere exceeds the one in the Earth’s atmosphere by 3 orders of magnitude, 
which in turn is still one order of magnitude higher than in the magnetosphere 
of Uranus. In addition, the electron energies in the Jovian magnetosphere are 
much higher than in the other magnetospheres. The differences in energies 
and fluxes of protons are even more pronounced. 
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Fig. 9.13. Fluxes of energetic protons in the magnetospheres of Earth, Jupiter, 
Saturn, and Uranus. Reprinted from C.T. Russell and R.J. Walker [452], in Intro- 
duction to space physics (eds. M.G. Kivelson and C.T. Russell), Copyright 1995, 
with kind permission from Cambridge University Press 



Magnetospheric size cannot be the parameter responsible for the differ- 
ent fluxes and energies because in the rather small terrestrial magnetosphere 
higher fluxes and more energetic particles can be observed than on Saturn, 
Uranus, and Neptune. The differences more likely result from the interaction 
between the magnetosphere and the solar wind as the ultimate source of en- 
ergy. In particular, differences in the reconnection rates at the magnetopause 
might feed more energy into the Earth and Jupiter systems, which in turn 
leads to more efficient acceleration. Loss mechanisms for the accelerated par- 
ticles, in particular pitch-angle scattering into the loss cone, might modify 
the particle distributions; however, they are not likely to explain the large 
differences in energy and fluxes. 
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9.5 Summary 

Almost all planets in the solar system, except Mars and Venus, have a magne- 
tosphere. The structures, sizes, and dynamics of these magnetospheres can be 

quite different. The most important parameters determining magnetospheric 

properties can be summarized as follows: 

• The planet’s magnetic moment combined with the local solar wind pressure 
determine the typical size of the magnetopause. If this is smaller than the 
planet’s radius, no classical magnetosphere is observed. 

• The structure of the magnetosphere is determined by the inclination of the 
dipole axis with respect to the axis of rotation and the plane of ecliptic, 
leading to Earth-like magnetospheres, pole-on magnetospheres, or oscilla- 
tions between these two states, as observed in Neptune’s magnetosphere. 

• Jupiter’s magnetosphere is the largest object in the solar system. Owing 
to strong centrifugal forces, the Jovian magnetosphere is rather flat with a 
plasma sheet in the equatorial plane. 

• Sources of magnetospheric plasma include the solar wind (except for 
Uranus), the atmospheres and ionospheres (except for Mercury), and at 
the outer planets also the moons and rings. 

• Radiation belts can be observed inside the Earth’s magnetosphere and the 
magnetospheres of the outer planets. At the orbits of the moons particle 
fluxes are signiflcantly lower because the moons absorb particles. Some 
moons create plasma tori along their orbits, such as lo in the Jupiter system 
and Titan orbiting Saturn. 



Exercises and Problems 

9 . 1 . Determine the energies of electrons and protons in resonance with the 
waves in the foreshock of the different planets. Use the numbers given in 
Fig. 9.11. 

9 . 2 . Confirm the calculated stand-off distances given in Table 9.3 (assume 
that the solar wind density decreases as r~^). 

9 . 3 . Discuss the relationship between the spin period and the magnetic mo- 
ment of the planet in terms of the magnetohydrodynamic dynamo. Would 
this relationship be in agreement with the assumption of a similar process 
working in all planets? 

9 . 4 . Discuss the possibility of aurorae and their shapes and detectability on 
other planets. Use your knowledge about the aurorae on Earth. 



9 . 5 . Describe and discuss the plasma sources in the different magnetospheres. 




10 Solar— Terrestrial Relationships 



It is a good morning exercise for a research scientist to 
discard a pet hypothesis every day before breakfast. It 

keeps him young. 
K. Lorenz, The So-Called Evif 



Solar-terrestrial relationships deal with the influence of the Sun and solar 
activity on our terrestrial environment. The driving force is the input of 
energy and matter into geospace. From the viewpoint of space plasma physics, 
the most important (and also the scientifically soundest) consequences have 
been discussed in Chap. 8. But there are also side-eflFects to these phenomena, 
such as the influence of solar energetic and auroral particles on the chemistry 
of the atmosphere, or the uproar caused in our technical environment due 
to severe geomagnetic disturbances. Other connections exist, too, relating 
solar cycle variations to weather and climate. These latter bear the seed for 
controversial discussions; nonetheless, some of the ideas will be reported here. 

Recent reviews on solar-terrestrial relationships are given in [41,67,172, 
498]. The Sun’s role in climate change is discussed in [238] and in the very con- 
troversial popular account [74]. Internet resources include www.estec.esa. 
nl/wmwww/wma/spweather/, www . windows . near . edu/spaceweather/basic_ 
facts.html, and www.sel.noaa.gov/index.html; additional links can be 
found at space . rice . edu/ISTP/. 



10.1 Solar-Terrestrial Relationships: Overview 

The Sun is a source of electromagnetic radiation, plasmas, fields, and ener- 
getic particles. These emissions have a constant component and a variable, 
solar-cycle-dependent one (see Fig. 10.1). Ordered by decreasing energy, they 
shape our terrestrial environment as follows: 

(a) The continuous solar electromagnetic radiation, described by the solar 
constant, determines the structure of the atmosphere and the climate. Since 
the solar constant varies by less than 0.1% during the solar cycle, its variation 

^ Reprinted from K. Lorenz On Aggression (transl. M. Latzken), Routledge, Lon- 
don, Copyright 1983, with kind permission from Deutscher Taschenbuch Verlag. 
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Fig. 10.1. Simplified overview of solar-terrestrial relationships. The top panel is 
from a poster for a 1978 workshop on solar-terrestrial infiuences on weather and 
climate at the Ohio State University 



is not recognized in climate. Although the solar constant is roughly constant, 
the hard electromagnetic emission is enhanced by up to an order of magnitude 
during solar maximum [310], causing solar cycle variations in the atmospheric 
height, the thermospheric temperature, and the ionospheric layers. 

(b) The solar wind and the frozen-in magnetic field shape interplanetary 
space and, on interaction with the terrestrial magnetic field, form the Earth’s 
magnetosphere. The energy density in the solar wind is six orders of mag- 
nitude smaller than the one in solar electromagnetic radiation. Nonetheless, 
disturbances in the solar wind cause much stronger modifications in the ter- 
restrial environment than does the flare’s increased electromagnetic radiation. 
In addition, the structure of the heliosphere leads to modulation of the galac- 
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tic cosmic radiation. These particles influence the chemistry and ionization 
of the atmosphere. 

(c) Solar flares emit electromagnetic radiation from the gamma to the 
radio range. As a consequence, the temperature, circulation, and chemistry 
of the upper atmosphere, and even the upper stratosphere, can be modified, 
leading to an increased drag on satellites in low orbits or to sudden iono- 
spheric disturbances. Since the travel time of light from the Sun to the Earth 
is only 8 min, these effects start immediately after the flare. 

(d) Coronal mass ejections and the shocks driven by them cause geomag- 
netic disturbances. Their influence is delayed by 1 to 3 days with respect to 
the flare. Both transfer energy to the magnetosphere which, during auroral 
activity, partly is transferred to the upper polar atmosphere. Energetic parti- 
cles accelerated at the shock can penetrate into the atmosphere at the polar 
cusps, causing additional ionization. This in turn influences the temperature, 
chemistry, and circulation of the thermosphere. 

(e) Solar energetic particles arrive at Earth a few hours after the flare. 
Like the particles accelerated at the shock, they can interact with the upper 
atmosphere in the cusp regions. 

In solar-terrestrial relationships, three “levels of knowledge” can be dis- 
tinguished: (i) accepted relationships with unambiguous observations and 
plausible explanations (for instance, the variations in the upper atmosphere 
and the disruptions to our technical environment), (ii) relationships with 
sound observations but with insufficient explanation (such as the long-term 
relationship between solar activity and climate, as evidenced, for instance, 
in the Little Ice Age), and (hi) relationships with less sound observational 
evidence and lack of sufficient explanations (such as the influence of the Sun 
on the drought cycle in the western US). In the latter cases, a relationship 
often appears unreasonable since the energy fed into the terrestrial system 
is small compared with the energy contained in the phenomenon. However, 
it should be noted that energetics is a tricky argument because the climate 
system is highly non-linear. Thus small energy inputs can act as triggers or 
might be amplified to cause strong consequences. 

In the remainder of this chapter, I shall give some examples of possible 
solar-terrestrial relationships. Since the energy argument is very strong, I 
shall start with a brief overview of the role of the solar electromagnetic ra- 
diation in our climate system, although strictly speaking this is beyond the 
scope of this book. But since all other effects have to be compared against 
variations in electromagnetic radiation, we cannot neglect such variations. 



10.2 Solar Activity, Climate, and Culture 

A controversial topic is the relationship between solar activity, weather, and 
climate. Recently, the discussion has become more aggressive since some au- 
thors claim that the temperature increase observed over the last hundred 
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years can be explained solely by variations in the solar output while other 
claim that it is solely anthropogenic. Since the atmosphere is a complex 
system, strongly coupled to the oceans, it is not likely that there is only 
one cause; rather climate variability most likely is influenced by many fac- 
tors which include internal oscillations of the climate system, extraterrestrial 
causes, as well as anthropogenic influences. To delve deeply into this debate 
would be the topic of another book; nonetheless, one observations relating 
climate and solar activity will be presented here as a starter, in particular, 
because it is often cited as reference. 

Although evidence is only indirect, a widely accepted relationship between 
solar activity and terrestrial climate exists for the last 7000 years. This cor- 
relation is closely tied to the name of Eddy [144, 145]; however, attempts to 
correlate the Sun and climate can be traced back for at least 200 years, a 
review is given in [41]. 

Figure 10.2 correlates solar activity and climate parameters. In the upper 
panel, the deviation of the content from its long-term trend is shown 
on a reversed scale, that is an increase in which indicates a decrease in 

solar activity, is pointing downwards. Since is a cosmogenic nuclide, it 
provides a measure of the intensity of the galactic cosmic radiation, which 
in turn is modulated by solar activity: high solar activity corresponds to a 
reduction in GCRs and therefore also in while during the solar minimum 
both are enhanced. Thus the record can be used as a long-term record 
of solar activity, which is shown in the middle panel. The panel does not 
show individual cycles because it is difficult to extract them from the 
record since the isotope is produced in the stratosphere but stored in organic 
matter.^ The combined transport and assimilation processes have time scales 
of some years, smearing out the individual solar cycles. The scale on is 
reversed because it should serve as a proxy for solar activity: a downward 
excursion from the average is meant as indicator for low solar activity. In the 
bottom panel, climate parameters are shown with T being the temperature, 
and W the winter index as an indirect measure of the strength of the winter, 
inferred from the time of freezing and thawing of certain lakes or rivers. The 
curve G gives the advance and retreat of glaciers in Europe. 

The flgure suggests that times of high solar activity are connected with a 
retreat in glaciers while lower solar activity corresponds to their advance. On 
shorter time scales of some hundreds to about a thousand years, the crude 
climate measure ‘glacier’ is supplemented by historical records and in the 
last 400 years also by direct measurements. Times of high solar activity not 
only are connected with higher temperatures but also with cultural growth. 
Examples are indicated by the vertical lines: (12) marks the Sumerian max- 
imum, the first advanced civilization, between about 2700 BC and 2600 BC, 
(11) the Pyramid Maximum (2300 BC to 2050 BC), (10) the Stonehenge Max- 

^ For solar cycle analysis the cosmogenic nuclide is more suitable because the 
time scales of its deposition are shorter. For a recent review see [37]. 
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Fig. 10.2. Changes in solar activity as inferred from cosmogenic nuclides and 
related changes in climate. Reprinted from J.A. Eddy [144], in Physics of solar 
planetary environments (ed. D.J. William), Copyright 1976, American Geophysical 
Union 



imum (1850 BC to 1750 Bc), (6) the Roman Maximum, and (4) the Medieval 
Maximum, when the climate was warm enough to allow the Vikings to cross 
the Atlantic Ocean and to discover Greenland as a green land. Times of low 
solar activity, such as the Sporer Minimum (2) and the Maunder Minimum 
(3), both marking the Little Ice Age, or (5) the Medieval Minimum often 
are related to cultural depression and retreat or to migration of nations in 
the hope of finding a more suitable environment. Evidence for higher solar 
activity during the Roman Maximum is also given by the frequently observed 
and reported aurora which today is rare in the Mediterranean. 

Although the observations are accepted, the mechanism (s) behind the 
correlation between weak solar activity and a colder climate is still not com- 
pletely understood. When the first measurements of the solar constant over an 
entire solar cycle became available in the late 1980s, they showed only a small 
variation of 0.08% in the solar constant with a maximum during solar maxi- 
mum. Thus despite the larger number of dark spots, the Sun emits more radi- 
ation at maximum conditions. This additional radiation comes from a bright 
chromospheric network related to the magnetic field structure. Nonetheless, 
for the climate system a variation in the solar constant by less than 0.1% is 
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way too small to produce any recognizable signal, even if this reduction lasts 
for some tens of years. 

Thus if we assume that the minima in solar activity indicated in Fig. 10.2 
are nothing more than a prolonged ordinary solar minimum, changes in solar 
irradiance would not explain the variations in climate. But long-lasting min- 
ima might be different from the ordinary sunspot minima known today. A 
careful analysis of observations of the Sun during the Maunder Minimum in 
the late seventeenth century suggested a larger solar diameter and a slower 
but stronger differential rotation than observed during the more recent solar 
cycles. In addition, although no sunspots were observed, the cosmogenic nu- 
clide ^®Be suggested the continuation of a solar cycle with a reduced period 
of about 9 years. ^ Thus the solar dynamo was still at work although at a 
faster pace and with weaker photospheric fields, as is evident from the lack 
of sunspots. Prom these changes in solar properties, reductions of the solar 
constant between 0.14% and 0.5% have been suggested. Presently it is still 
debated which of these numbers is more reliable and whether it alone would 
be sufficient to explain the Little Ice Age. However, there are also sugges- 
tions that, despite the lack of visible solar activity in the form of sunspots, 
the aurora as a consequence of solar activity still follows the 11-year solar 
cycle [467]. 



10.3 Solar Electromagnetic Radiation 

The Sun’s electromagnetic radiation is the ultimate energy input into the 
terrestrial system, shaping the climate system, the hydrological cycle, and 
also the biosphere. Thus variations in this energy input are expected to have 
consequences for the system of the Earth. Before we look at variations, first 
let us have a closer look at the role of the Sun in the climate system. 

10.3.1 The Climate System 

Climate is weather averaged over a long time period, or, as Pollack [415] 
puts it, “Climate is what you expect, weather is what you get.” The climate 
system basically resides in the troposphere; the upper atmospheric layers are 
believed to have only a small influence on climate. 

On time scales relevant for humans and societies, the Earth is believed 
to have a constant average surface temperature. Therefore, incoming and 
outgoing energy must be in an equilibrium. The incoming energy is described 
by the solar constant So = 1380 W/m^. This is intercepted over the area of a 
circle with a diameter equal to the radius of the Earth, which provides a power 

^ ^^Be records are stored in the Arctic and Antarctic ice sheets. In contrast to ^^C, 
the Be isotope is immediately washed out of the atmosphere by precipitation, 
allowing the detection of variations on time scales of about a few years. 
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Fig. 10.3. The effective temperature of 
the Earth is determined by the equilib- 
rium of incoming short-wave solar radia- 
tion and outgoing long- wave terrestrial ra- 
diation 



equal to Part of the radiation is reflected directly back into space. Its 

amount is described by the albedo A, which, averaged over the entire Earth, 
is about 30%. Thus the incoming power is given by Pin = (1 — ^) 5'o7rr|. The 
outgoing radiation can be characterized by an effective temperature Teff and 
a resulting radiation flux q — crT^^. This radiation is emitted over the entire 
Earth; see Fig. 10.3. Thus the outgoing power is Pout = 47rr|(jTg^ff. Since 
this equals the incoming radiation, we obtain the following for the effective 
temperature of the Earth 

Teff = yUEZS « 253 K . (10.1) 

This is well below the observed annual mean temperature of 288 K. The 
diflFerence results from the greenhouse effect: solar electromagnetic radiation 
is short-wave radiation which passes through the atmosphere nearly unhin- 
dered. The terrestrial radiation, on the other hand, is long-wave radiation 
well in the thermal infrared. This is absorbed mainly by water vapor and 
carbon dioxide in the atmosphere. Therefore, the atmosphere is heated and 
itself radiates. Part of this atmospheric radiation is again absorbed by the 
Earth. Thus the Earth receives the incoming solar short-wave radiation and 
recycled terrestrial long- wave radiation. Consequently, it has a higher tem- 
perature. Most of the above temperature difference of 35 K is due to absorp- 
tion by water vapor; the remaining difference is due to absorption by carbon 
dioxide and other, partly human-made, greenhouse gases. These gases are 
receiving a lot of attention in the discussion of human influences on climate, 
in particular the anthropogenic greenhouse effect. The interested reader is 
referred to [229] or to the detailed reports by the Intergovernmental Panel 
on Climate Change (IPCC) [62,230-233,550]. 

10.3.2 Short-Term Variability of the Solar Constant 

Obviously, the Earth’s effective temperature and thus also the Earth’s climate 
will change if any of the parameters in (10.1) changes, in particular, if the 
solar constant changes. 

Direct evidence for variation of the solar constant can be inferred from 
satellite instruments such as the Earth Radiation Budget Experiment (ERBE) 
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and the Active Cavity Radiometer Irradiance Monitor (ACRIM). Measure- 
ments are available since the late 1970s and reveal variations in the solar 
constant during the solar cycle by about 0.08%; for a recent review see [311]. 
The relative variation in the solar constant during the solar cycle is there- 
fore only small. Using (10.1), we can relate changes in the solar constant to 
changes in temperature by 



(5Teff ^ 1 SSq 
Teff 4 5o 



( 10 . 2 ) 



A 0.08% change in the solar constant then corresponds to a change in the 
effective temperature by 0.07 K over the solar cycle. This is much smaller than 
the year-to-year variability in temperature, and thus even if this influence 
were present it would probably be much too small to be detected; see the 
discussion in [238]. 

The picture changes if we look not at the solar constant, which is the 
integral over the solar emission spectrum, but at the spectrum itself. At the 
fringes of the black-body spectrum, in particular in the EUV and hard X- 
rays, the variability over the solar cycle is much larger, up to a factor of 10 
to 1000 (see Fig. 10.4). These frequency ranges, however, are absorbed in the 
mesosphere and thermosphere and are responsible for the large solar-cycle- 
related atmospheric variations observed above a height of about 80 km, in 
particular, variations in the thermospheric temperature, electron content in 
the ionosphere, and ionospheric heights. Even in the UV, which is respon- 
sible for both the production and the destruction of ozone, the solar-cycle 
variability is about 15%. Nonetheless, again the argument is that the energy 
content in this part of the Sun’s electromagnetic spectrum is way too small 
to produce any recognizable signal in the climate system. Before we start to 
speculate about effects related to these frequency ranges, let us first have a 
look at variations of the solar constant on longer time scales. 



10.3.3 Long-Term Variability of the Solar Constant 

Direct observations of the solar constant are limited to the last two solar 
cycles. The variability on longer time scales can be inferred by a number of 
different methods, for instance extrapolation from stellar activity; correla- 
tions between the Sun’s energy output and the solar radius and/or rotation 
rate, both of which have been observed since the Maunder Minimum; and ex- 
trapolation from the Call line, which is a measure of solar magnetic activity. 
Results from these methods are summarized in [41,118,496]. The various ap- 
proaches give a reduction of the solar constant during the Maunder Minimum 
of the order of 0.5% or a few watts. 

Although these variations are larger than the present solar-cycle variation 
of the solar constant, it appears doubtful whether they are sufficient to explain 
the Little Ice Age during the Maunder Minimum if the simple estimate of 
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Fig. 10.4. Solar spectrum (top) and spectral variability of the solar constant (bot- 
tom) derived from satellite measurements during solar cycle 21. Reprinted from 
J. Lean, Variations in the Sun’s radiative output [310], Copyright 1992, American 
Geophysical Union 

(10.1) is used. However, if inferred reduced solar constants are used in a 
coupled ocean-atmosphere global climate model, they give a small global 
cooling and a stronger cooling in the north Atlantic region, leading to a 
marked variation of the North Atlantic Oscillation (NOA). The resulting 
patterns, in particular those of storm tracks, are close to the weather and 
climate reports from the Maunder Minimum [155,442]. 

Nonetheless, this result is still a tentative one, because the chain of as- 
sumptions is long: the solar constant during the Maunder Minimum has to 
be guessed; the climate of the Little Ice Age has to be inferred from incom- 
plete data (in particular, data limited mainly to Europe); the climate model, 
although extremely complex, is still incomplete and so on. But the result is 
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important insofar as it suggests that variations in the Sun’s electromagnetic 
output have consequences for the terrestrial climate and that other influ- 
ences on climate, such as anthropogenic influences or space plasmas, have to 
be compared with this natural climate variability. 

10.3.4 At the Fringes of the Electromagnetic Spectrum 

The variability with the solar cycle is much larger at the fringes of the spec- 
trum, although the total energy flux is small. For instance, the variability 
around 100 nm can amount to about 15%; the resulting change in irradiance, 
however, is only about 0.1 W/m^. Nonetheless, the upper atmosphere reacts 
strongly to these variations: during the solar cycle, thermospheric tempera- 
tures vary by about 50-60 K at heights around 120 km and by some hundreds 
of K above 400 km, and the entire atmosphere expands during solar maxi- 
mum. Even a lower part of the atmosphere, the stratosphere, reacts to the 
solar cycle: the total ozone column varies by a few percent, with a maxi- 
mum around solar maximum [213,480] because the increased amount of hard 
electromagnetic radiation not only destroys but also creates ozone. 

Solar Electromagnetic Radiation Summarized: The variation in total 
solar irradiance (solar constant) is too small to lead to a detectable solar cycle 
signal in weather and/or climate parameters. Nonetheless, the Maunder Min- 
imum and Little Ice Age suggest that such a link might exist. The variations 
in solar irradiance are much stronger at the fringes of the spectrum and lead 
to detectable signals in the upper atmosphere, even down to the stratosphere. 
Since these harder wavelengths are absorbed well above the troposphere, they 
should not influence weather and climate. Nonetheless, with a suitable cou- 
pling mechanism between the upper atmosphere and the troposphere, the 
solar-cycle-induced variations at greater heights might have consequences for 
the troposphere, for instance because of wave coupling [481,482]. 



10.4 Energetic Particles and the Atmosphere 

Solar cycle variations in the upper atmosphere as well as its responses to so- 
lar activity are supported by many observations and can be understood quite 
well. The solar-cycle-dependent input into the upper atmosphere comprises 
hard electromagnetic radiation (its variation with the solar cycle as well as 
enhancement during flares), auroral particles, solar energetic particles, and 
galactic cosmic radiation. The consequences of these inputs are heating and 
ionization. The energy input follows different spatial patterns: changes in the 
solar electromagnetic radiation directly affect the entire dayside atmosphere 
or, since the exchange between dayside and nightside is fast and the atmo- 
sphere rotates with the planet, often the entire atmosphere. The galactic 
cosmic radiation basically influences the entire atmosphere, too. The energy 
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input of auroral and solar energetic particles, on the other hand, is limited 
to high latitudes. Consequences of variations in the electromagnetic radia- 
tion are, for instance, an increase in thermospheric temperature and density 
with increasing solar activity, changes in thermospheric composition, or an 
increase in ionospheric electron density with increasing solar activity. 



10.4.1 Precipitating Energetic Particles: 
Primary Consequences 



The primary consequence of the precipitation of energetic charged particles 
into the atmosphere is ionization. Some secondary effects resulting from this 
ionization are electron density increases in the ionosphere, modifications of 
the atmospheric chemistry, in particular the creation of NOa, and the destruc- 
tion of ozone, and variations in the global electric circuit. All effects are well 
documented insofar as they can be observed directly in individual events and 
do not require correlative studies. In addition, the physical mechanisms are 
understood reasonably well. 

The consequences of variations in the particle input into the atmosphere 
include ionization and heating, too. The particle energies range from a few 
kiloelectronvolts (auroral particles) to megaelectronvolts (solar energetic par- 
ticles) and even into the gigaelectronvolt range (galactic cosmic rays). Let us 
consider the direct energy transfer only, i.e. we do not follow the path of 
the particles but assume the particles to be incident isotropically from the 
upward-looking hemisphere into the upper atmosphere. The primary energy 
transfer mechanism is ionization, as described by the Bethe-Bloch equation: 

r. 2m,v^ ln(l-/ 32 )_/ 32 l 
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The energy loss dE per distance travelled dx thus depends on some funda- 
mental constants (the first fraction) , the parameters of the incoming particle 
(the second fraction, where Z is the charge of the particle and v its velocity, 
and P = v/c), and the parameters of the material (the remaining part of the 
equation, where Uq is the electron density and (Eb) the average ionization 
energy). The energy loss of an ionizing particle in a given material (that is. 
He = const) therefore increases with the particle’s charge and decreases as 
The terms in the square brackets are corrections which become impor- 
tant at relativistic energies: the first term depends on the ratio of the particle 
energy to the average ionization energy, and the other terms are the relativis- 
tic corrections depending on the ratio (3 between the particle’s speed and the 
speed of light. For 1, the energy loss is constant and minimal: the par- 
ticle is minimally ionizing, with the energy loss determined by the particle’s 
charge (and the parameters of the absorbing material). For non-relativistic 
particles, the energy loss is 



dE Z 
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Fig. 10.5. Energy loss of 
a proton in the atmosphere 
for various energies between 



1 MeV and 1000 MeV 



Equation (10.3) gives the specific energy loss of the particle and therefore 
can be used to track the particle through the atmosphere. The equation 
has to be solved numerically. Figure 10.5 shows the energy loss for protons 
with various energies between 1 MeV and 1 GeV in the atmosphere. At a 
given height, dE/dx decreases with increasing particle energy because of the 
v~‘^ dependence of the energy loss. For the same reason, the energy loss 
increases towards the end of the particle range: most of the particle’s energy 
is deposited close to its stopping height in the atmosphere. Since the energy 
is lost because of ionization of the atmosphere, the particle’s energy loss is 
directly proportional to the number of ion pairs produced along its track, 
with the proportionality coefficient being the average energy required for one 
ionization. 

During an SEP event, a large number of particles precipitates into the 
atmosphere. Each individual particle causes ionization as described by (10.3) 
and Fig. 10.5. For the atmospheric consequences, we are interested in either 
the total ionization (that is, the ionization of all particles is summed over the 
entire event) or the ion pair production rate, that is, the number of ion pairs 
produced per unit time interval. This latter quantity is required for chemistry 
models. It can be obtained from energetic-particle measurements, such as 
those shown for the Bastille Day event in Fig. 7.26, by folding the observed 
particle intensities with their respective energy losses [426]. Figure 10.6 shows 
the ion pair production rates for three days of the Bastille Day event. Curve 
1 includes the onset of the event, where the highest particle energies are 
dominant. Consequently, ion pair production occurs at rather low altitudes. 
With increasing time, the intensities of high-energy protons decrease while 
the intensity at lower energies increases, shifting the ion pair production 
to greater heights and to a larger maximum. In particular, the bulk of ion 
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Fig. 10.6. Ion pair produc- 
tion rates (daily averages) 
during the Bastille Day event 
(see Fig. 7.26) 



pair production is at heights between 30 km and 90 km, where normally the 
atmosphere is neutral: thus energetic charged particles provide a source of 
ionization at heights where normally no ions exist. Therefore they can have 
a rather strong influence on atmospheric chemistry. 

Excursion 8. For a simple estimate, we can circumvent the numerical in- 
tegration of the Bethe-Bloch equation with the following ansatz. As above, 
the ionization rate is the product of the flux of precipitating protons and 
dE/dz, their energy loss, giving an ionization rate 

d F 

q,{z) = Fe— . (10.5) 

The energy loss can be approximated as 

^ = KeEionCTnn • ( 10 . 6 ) 

This quantity depends on the collisional ionization rate Kq, the energy E’ion of 
the precipitating ions, the interaction cross section an and the number density 
n. Inserting (10.6) into (10.5), taking into consideration the barometric height 
formula (8.19), yields 

^e('^) ~ ^e-FeF/iQji<:^n^O OXp( Z j . (10.7) 

From (10.6) we also obtain 

E Xs 

Eo = = j KeWion(Tnnoexp{~z/H) . (10.8) 

0 oo 

Solving for Zg gives the stopping height: 

Zs^Hln(^K^annoH^^ . (10.9) 
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10.4.2 Precipitating Particles and Ozone 

The consequence of ionization by precipitating particles is a modification of 
the atmospheric chemistry. The bulk of the ions produced by solar energetic 
protons or magnetospheric electrons in the middle atmosphere, i.e. at alti- 
tudes from a few tens of kilometers up to about 90 km, are Nj and Oj , simply 
because N2 and O2 are the major species; smaller amounts of Ar“^, CO^, O3 , 
O"^, and are produced too. The electrons released in these interactions 
either are lost by recombination with a positive ion or form negative ions 
with a previously neutral atmospheric constituent. 

A Little Ozone Chemistry. The most important chemical reactions in the 
middle atmosphere involve the ozone layer. Some of the ions created by the 
incident protons are ozone-destroying radicals. The reduction of the ozone 
layer locally can be as strong as a factor of 2 to 4 , as has been observed in 
the strong polar cap absorption (PCA) event following a flare in November 
1969 [ 555 ]. Both and Nj contribute to ozone destruction. 

The destruction of ozone by Oj is based on a chain of events. First, the 
ionized oxygen molecule has to attach itself to a water molecule. The latter 
then reacts with another water molecule: 

0 + • H2O + H2O H3O+ • OH + O2 , 

forming a hydroxyl radical OH and an H"^. The HO^; species are predom- 
inately formed in the upper stratosphere and the mesosphere. They are a 
rather short-lived component which is destroyed by photochemical processes 
within less than a day. 

The following pairs of reactions can result: 

H + O3 OH + O2 , 

OH + O H + O2 , 

OH O3 HO2 + O2 , 

HO2 H” O OH + O2 , 

OH + O3 HO2 + O2 , 

HO2 + O3 ^ OH + 2O2 . 

In the upper two chains, the net effect is the recombination of an ozone 
molecule and an oxygen atom, yielding two oxygen molecules: 

0 + 03^202. 

In the lower chain, the net effect is a recombination of two ozone molecules, 
yielding three oxygen molecules: 

O3 + O3 — > 3O2 • 

In these reactions, H and OH act as catalysts. They are lost from the reaction 
chain only when two of them meet, forming either H2 or H2O. These reactions 
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Fig. 10.7. Calculated 
NO-production in the at- 
mosphere for three solar 
energetic-particle events 
and for the galactic cos- 
mic radiation under solar 
minimum and maximum 
conditions. Reprinted from 
G.C. Reid [439], in Physics 
of the Sun, voL III (eds. P.A. 
Sturrock, T.E. Holzer, D.M. 
Mihalas, and R.K. Ulrich), 
Copyright 1986, with kind 
permission from Kluwer 
Academic Publishers 



have time constants of a few hours, and thus the reaction chain cannot go on 
indefinitely and the ozone will recover after several days. In addition, these 
reactions reduce the water content in the middle atmosphere, since during 
recombination more H2 is formed than H2O. Thus further ozone reduction, 
for instance during the next PCA, will be reduced. This is a kind of self- 
healing of the atmosphere. 

Since they are long-lived, stable ions function as catalysts only; the main 
hazard for the ozone layer is the nitrogen oxides, in particular NO. These 
NOx species are formed in both the stratosphere and the mesosphere. For 
instance, NO is produced by a two-step reaction from Nj. First, Nj under- 
goes dissociative recombination, giving two neutral nitrogen atoms. If these 
interact with oxygen, then NO is formed: 

N + O2 -> NO + O . 

Ozone destruction is then caused by the reactions 

NO -p O3 — > NO2 “h O2 , 

NO2 -p O — > NO + O2 . 

One or two large PCA events, lasting for 2 or 3 days, produce more 
nitrogen oxides than the galactic cosmic radiation produces during an entire 
year; see Fig. 10 . 7 . Thus at times of high solar activity, the NO production in 
the mesosphere is determined basically by solar activity. The nitrogen oxides, 
however, are produced at higher altitudes than those produced because of 
ionization by galactic cosmic rays since the solar energetic particles have 
lower energies. But the removal of the ozone-destroying NO is slower at higher 
altitudes: the loss mechanism for NO above about 40 km is photodissociation. 
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leading to the formation of N 2 . NO produced by the galactic cosmic radiation 
close to the tropopause, on the other hand, can be washed out rather easily, 
forming nitric acids. 

Observational Evidence: The Bastille Day Event. Evidence for a de- 
pletion of the ozone layer was first reported for the large SEP of August 
1972 [117,219]. A prominent recent example is the ozone depletion in the 
Bastille Day event of 14 July 2000. The proton intensity profiles have al- 
ready been shown in Fig. 7.26, and the ion pair production rates are given in 
Fig. 10.6. Ozone observations by the HALOE instrument on board the UARS 
satellite show a marked reduction in ozone concentration inside the polar caps 
during the particle event; see Fig. 10.8. Ozone concentrations above 0.5 hPa 
(about 50 km) are reduced by 40% for about 2 days, mainly owing to the 
short-lived HO^c. Ozone depletion at lower heights is smaller but lasts longer 
because the long-lived NO^^ destroys ozone. 

Numerical models of atmospheric chemistry and circulation combined 
with ionization models have been developed well enough to model the basic 
observed features of particle-induced ozone depletion, namely the spatial and 
temporal patterns. The ionization can extend to geomagnetic latitudes below 
the polar cap: large SEP events are accompanied by travelling interplanetary 
shocks and CMEs. As these hit the Earth, geomagnetic disturbances lead to 
a rearrangement of the field and allow particles to have access to normally 
closed dipole field lines. This reduction in the geomagnetic cutoff latitude has 
been observed by SAMPEX [321]. 

Although this effect is well documented, we should be aware that ozone 
depletion due to energetic particles is limited to higher latitudes, namely 
inside the polar cap, and to heights above about 35 km. Thus SEP events 
affect the atmosphere but only in a limited spatial region. 



Fig. 10.8. Ozone concen- 
tration above 0.5 hPa in 
the northern hemisphere prior 
to (left) and during (right) 
the Bastille Day event. The 
white ring marks the polar 
cap. Reprinted from C. Jack- 
man et al. [252], Geophys. 
Res. Lett 28, Copyright 2001, 
American Geophysical Union 
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Precipitating Particles During the Solar Cycle. SEP events typically 
last for a few days and are more frequent during solar maximum than during 
solar minimum. Since the NO^ produced by the precipitating particles is long- 
lived [251,536,537], the NO^; production of subsequent events is superposed, 
leading to stronger atmospheric effects. 

Figure 10.9 shows the changes in the total ozone column due to precip- 
itating particles calculated from the observed proton fluxes at GOES using 
the SLIMC AT/TOMCAT chemistry model [99] for the time period 1988 to 
2002. The solar cycle is obvious with maxima in ozone depletion around the 
solar maxima in 1990 and 2000. Although the particle precipitation is limited 
to latitudes well above 60°, ozone depletion is also visible at lower latitudes, 
in particular during times of high solar activity. This spatial spread of ozone 
depletion reflects the atmospheric transport: NOa; is produced in the polar 
caps at high latitudes but can be transported to lower latitudes. At mid- 
latitudes, the solar cycle variation in ozone due to precipitating energetic 
particles is comparable to that due to the variation in hard electromagnetic 
radiation, although the two effects have opposite signs. The differences be- 
tween the northern and the southern hemisphere result from the concurrent 
photochemical processes. The first strong signal of ozone depletion in Fig. 10.9 
is due to the October 1989 event; it is much more pronounced in the north- 
ern hemisphere because it occurs just at the beginning of the polar night 
and thus no photochemical reactions can destroy NO^;. In the southern hemi- 
sphere, on the other hand, ozone depletion is much smaller because here NO^; 
is destroyed by photochemical processes. 

Since ozone also has radiative properties, precipitating energetic particles 
influence not only the atmospheric chemistry but also the radiative trans- 
port. As a consequence, temperatures in the stratosphere are modified [489]. 
Since circulation patterns are modified when the temperature gradients are 
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Fig. 10.9. Variation of total ozone column calculated with the SLIM- 
CAT/TOMCAT model using proton intensities measured by GOES. Figure cour- 
tesy of M. Sinnhuber, University of Bremen; Copyright M. Sinnhuber 
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Fig. 10.10. As Fig. 10.9, but for a vanishing geomagnetic field. Figure courtesy of 
M. Sinnhuber, University of Bremen; Copyright M. Sinnhuber 



changed, precipitating particles also have the potential to influence climate. 
An anapliflcation of this process - or a damping - might also result from 
planetary waves (gravity waves) [481,482]. 

This effect might become even more pronounced on longer time scales for 
two reasons. First, the present-day Sun appears to be unusually quiet with 
respect to particle event production, as can be inferred from thin nitrate layers 
in ice cores [347, 348]- the very same nitrate that we have discussed above in 
connection with ozone chemistry. In particular, Carrington’s white light flare 
stands out in the record, suggesting a total particle fluence about eight times 
that in the largest event (October 1989) in spacecraft records. In addition, 
in the 1890s, a series of four events with particle fluences exceeding that of 
the October 1989 event is observed within a few years. Simulations suggest 
patterns of ozone destruction similar to those discussed so far; however, the 
magnitude of depletion is much larger. In particular, in the 1890s series of 
events, NO^ from the individual events is summed. 

Secondly, although it is decreasing, the geomagnetic held is still strong and 
efficiently reduces the fluxes of particles precipitating into the atmosphere. 
If the held were to decrease to about 25% of its present-day value, as typi- 
cally assumed for a held in reversal, particles would precipitate over a much 
larger area. Model simulations give a rather amazing result (see Fig. 10.10): 
although the particles now precipitate over the entire globe, ozone depletion 
at low latitudes is still rather weak, while it is strongly enhanced at high lat- 
itudes despite the fact that the particle fluxes have not changed there. This 
counter-intuitive result points to the problems in making simple estimates 
in a complex system and can be understood from the superposition of NO^; 
generation, transport, and depletion patterns. 

Precipitating Magnetospheric Electrons. SEP events are not the only 
particle events that influence atmospheric chemistry and ozone. During strong 
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geomagnetic storms, particles from the radiation belts are injected into the 
loss cone and precipitate into the atmosphere too. The electron component 
of the magnetospheric particles is energetic enough to precipitate down to 
altitudes of about 60 km. As in the case of the SEPs, these electrons ionize 
the atmosphere, which leads to NO^; production and ozone depletion [75-81]. 
It should be noted that the spatial precipitation pattern is different: while the 
solar energetic particles precipitate inside the polar cap, the magnetospheric 
particles precipitate inside the auroral oval. 

10.4.3 Precipitating Particles and Thermospheric Circulation 

We will now have a look at rather low energetic particles that stop in the 
thermosphere or upper mesosphere. The main consequence of these particles 
is an increase in ionospheric electron density and heating. 

The latitudinal temperature gradient in the thermosphere drives a merid- 
ional circulation system. This gradient is caused by the latitudinal varia- 
tion of both the solar electromagnetic radiation and energetic particles. As 
in the troposphere, more radiative energy is absorbed close to the equator 
than at high latitudes. In the thermosphere the energetic particles incident 
at high latitudes provide an additional heat source. Thus the thermospheric 
circulation system shows a seasonal dependence (latitude dependence of the 
electromagnetic radiation) as well as a dependence on the solar cycle. 

Figure 10.11 shows model calculations for the thermospheric circulation 
for the equinoxes (left) and solstices (right) for different levels of solar activ- 
ity: (a) an extremely quiet Sun, (b) average solar activity, and (c) a highly 
active Sun. The influence of the latitudinal dependence of the incident elec- 
tromagnetic radiation can be seen best for very quiet solar conditions: the 
thermospheric circulation then consists of a thermally driven Hadley cell. 
During the equinoxes, heating is strongest close to the equator, leading to 
an updwelling of hot air which, at higher altitudes, is transported poleward 
where it cools, sinks, and moves as cold air equatorwards through the strato- 
sphere (that would be at the bottom of the figure), closing the circulation 
cell. During solstices, only one Hadley cell evolves, spanning the entire globe. 
Heating is strongest at high latitudes in the summer hemisphere, thus air 
dwells up there and is transported across the equator towards the high lati- 
tudes of the winter hemisphere where it sinks. 

With increasing solar activity, heat is supplied to the high-latitude upper 
thermosphere by energetic particles. Thus close to the poles, secondary circu- 
lation cells evolve providing heat transport towards lower latitudes. During 
equinoxes, two such cells are observed, one over each pole. With increasing 
solar activity, these cells expand, nearly suppressing the thermally driven cir- 
culation. During the solstices only one such cell develops in the winter hemi- 
sphere. Wind speeds in the thermospheric circulation can be up to 2500 km/h. 

Figure 10.11 is a grossly simplified meridional cross-section. Since the 
Earth rotates, the real thermospheric circulation consists of a thermally 
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Fig. 10.11. Thermospheric circulation at the equinoxes and the solstices for ex- 
tremely quiet solar conditions (a), average solar activity (b), and a very active Sun 
(c). Reprinted from R.G. Roble [444], The upper atmosphere and magnetosphere^ 
Copyright 1977, with kind permission from National Academy Press 



driven meridional component and a zonal deflection due to the Coriolis force. 
Whereas in the troposphere the Coriolis force breaks up the Headley cell 
into three cells in each hemisphere, the thermospheric circulation pattern 
basically consists of the one cell shown in Fig. 10.11. Only around the geo- 
magnetic poles can closed zonal circulation patterns exist, depending on solar 
activity. In the southern hemisphere, closed vortices around the pole appear 
to be a dominant feature which is observed in the oceans, in the stratosphere 
(where it plays a prominent role in the seasonal appearance of the ozone 
hole), as well as in the thermosphere. 

10.4.4 Precipitating Particles and the Global Electric Circuit 

Precipitating energetic particles ionize the atmosphere. If the ionization is 
strong enough, even the electric properties of the atmosphere might be mod- 
ified. However, we normally think of the atmosphere in terms of a neutral 
medium and atmospheric electricity as a phenomenon limited to thunder- 
storms. But the galactic cosmic rays can penetrate well down into the tropo- 
sphere; in fact, they are the main source of ionization in the troposphere. Even 
directly above the ground, radioactive decay contributes only about half of 
the ionization. It has been suggested that energetic electrons can account for 
fair-weather lightning because with a sustained electron avalanche, they can 
provide the necessary charge transfer even at electric-fleld magnitudes much 
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smaller than needed for the initial breakdown [206,207]. In a well-developed 
thunderstorm, cosmic rays may introduce erratic variations in flash rate and 
type [341], 

Thunderstorms and ionization by energetic particles have to be viewed in 
the broader context of the global electric circuit between the Earth’s surface 
and the ionosphere. Between the ionosphere and the ground, there is a voltage 
drop of about 250 kV. Discharge through the atmosphere would destroy this 
potential drop within minutes. The most important generator in this circuit 
is the totality of global thunderstorms; smaller generators are tidal winds 
within the ionospheric dynamo region and the interaction of the solar wind 
with the magnetosphere. 

Some kind of continuation of thunderstorms from the troposphere up 
to the ionosphere is observed in the form of sprites and elves, which are 
brief flashes of light above large thunderstorms [562]. The sprite has been 
identifled as an electrical discharge; they tend to have a vertical extension 
and are some kind of electrical (not optical) mirror image of cloud-to-ground 
lightning. Elves are horizontally extended with a doughnut shape: the vertical 
lightning return stroke creates an electric fleld azimuthally symmetric around 
the axis of the lightning channel. 

The solar modiflcations to the global electric circuit are due to ioniz- 
ing radiation, that is, energetic particles and hard electromagnetic radiation. 
Both vary with the solar cycle: the ionizing radiation from galactic cosmic 
rays, which affects mostly the troposphere and lower stratosphere, is contin- 
uous and enhanced during solar maximum, while the ionization from SEPs 
and magnetospheric particles is sporadic. Ionization by SEPs occurs pre- 
dominately in the stratosphere and mesosphere and is more frequent during 
solar maximum, while ionization by magnetsopheric particles occurs in the 
mesosphere and above. Therefore, the global electric circuit is a system that 
depends on the solar cycle and couples different layers of the atmosphere, 
from the troposphere to the upper atmosphere [518,540]. 

The coupling between atmospheric electricity and climate is most likely 
due to cloud formation, the redistribution of charges within clouds, and the 
loss of charges by rain. Such a mechanism is attractive from the viewpoint of 
energy. As mentioned in the introduction to this chapter, the energy density 
in the space plasmas is much lower than that contained in the electromagnetic 
radiation. However, with the aid of the global electric circuit, the influence 
occurs not via the input of additional energy but via a redistribution of the 
available energy. Observational support also exists (see below). 

The variation of the ionization by space plasmas would influence both 
parts of the global electric circuit: the generator, thunderstorms, and the 
load, the atmosphere. This in turn modifies droplet formation and the de- 
velopment of clouds. In particular, an increased electric fleld leads to larger 
droplets and heavier precipitation in fair-weather areas. If heavier than usual 
precipitation has occurred, the condensation of water vapor must have been 
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stronger and, consequently, more latent heat has been released during cloud 
formation. Thus the higher troposphere has become warmer, which in turn 
affects mesoscale and synoptic circulation patterns - a possible influence on 
weather and climate. 

10.4.5 Solar Cycle Length, Galactic Cosmic Rays, 
and Cloud Cover 

The ionization-cloud-formation-climate change chain also underlies a hotly 
debated scenario linking solar activity to climate variations. 

For the time period 1870 to 1990, Friis- Christensen and Lassen [171] re- 
port a correlation (with a 95% confidence level) between the global tempera- 
ture trend and the length of the solar cycle: with decreasing cycle length, the 
surface temperature of the Earth increases. The two curves track each other 
quite well; in particular, the decrease in temperature between 1940 and 1970 
is correlated with an increase in the solar cycle length. 

A suggestion for the physical link has originated from an entirely differ- 
ent correlation, between the global cloud cover and the intensity of galactic 
cosmic rays [334,509,510]: with decreasing intensity of the galactic cosmic 
radiation, the cloudiness decreases too. Thus, it appears that the atmosphere 
acts as a giant cloud chamber. Subsequently, it has been shown that the 
galactic cosmic ray intensity also shows a long-term trend which is closely 
correlated with the solar cycle length [510]. Thus, it appears that the global 
temperature is influenced by the intensity of the galactic cosmic radiation, 
which in turn is determined by the state of the heliosphere. The causal link 
between galactic cosmic rays and temperature might be directly due to the 
cloud cover, as suggested in [509], or, in a more complex process, it might be 
related to global atmospheric electricity [517]. 

However, the picture might not be as simple as sketched here. For instance, 
some doubt has been raised about the original correlation between solar cycle 
length and temperature, which is basically concerned with the determination 
of the solar cycle length [372]. In addition, ground-based observations of cloud 
cover [301] do not seem to confirm the correlation reported in [334]. 

A more serious problem, however, exists not in the data analysis but in the 
causal chain: the correlation with clouds is best with low clouds (below 3 km) 
at low latitudes. At low latitudes, however, there is almost no modulation of 
the galactic cosmic radiation with the solar cycle; see Fig. 8.52. This does not 
disprove a possible causal relation between clouds and cosmic rays; however, if 
it were to exist, it would not be a simple, straightforward correlation. Instead, 
it must invoke the complexity of the climate system and its coupling with 
the global electric circuit - with the latter not being fully understood. 
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10.5 Sector Boundaries, Droughts, and Thunderstorms: 
Sun and Weather 

Attempts at correlations between weather or climate and the solar cycle or 
the geomagnetic activity, such as the example discussed above, are many; a 
comprehensive, though somewhat older overview can be found in [344]. Some 
of these correlations, such as the droughts in the western US, can be traced 
back for centuries, while others cover only shorter time spans of 3 to 10 solar 
cycles because older records are not available. 

In contrast to the particle influences discussed in Sect. 10.4.2 the data 
base is limited to correlations while the direct observation of the relevant 
details in the causal change and an understanding of this chain are missing. 

10.5.1 Droughts in the Western US 

A prominent example of the correlation of weather phenomena and the solar 
cycle are droughts in the western US, in the dust bowl. Droughts severely 
affect plant growth, and thus a drought index can be derived from tree rings: a 
wide tree ring indicates plenty of water available while a smaller one suggests 
a reduction in the water supply. To reduce the influence of temperature on 
tree ring size, plant species with different dependences on water supply and 
temperature can be compared. With these data in hand, a drought index 
for the western US can be traced back to about 1600, the time when the 
first systematic records of sunspots and solar activity started. There is a 
correlation between droughts and solar activity at the 99% confidence level; 
however, this correlation is not with the 11-year sunspot cycle but with the 
22-year magnetic cycle. The largest extension of the area affected by drought 
is about 2 to 3 years after the minimum at which the Hale cycle starts. The 
severity of the droughts is also modulated with a 90-year cycle, which also 
modulates the sunspot cycle. This is the GleiBberg cycle with is also evident 
in the nitrate layers in ice cores. 

The dependence on the 22-year cycle is amazing in so far as it suggests a 
relationship between climate and the magnetic properties of the Sun rather 
than with its radiation. Although the underlying processes are not completely 
understood, it appears likely that the energy that triggers the atmospheric 
change, i.e. the drought, is fed into the terrestrial system by geomagnetic 
activity and/or particles. During times of low solar activity, recurrent geo- 
magnetic disturbances can be observed (see Fig. 8.37). A comparison of held 
variations in subsequent solar rotations allows the definition of a geomagnetic 
recurrence index. The latter one is large only if the geomagnetic disturbance 
can be observed again during the next rotation. Thus the recurrence index is 
anti-correlated with the solar cycle since at times of high solar activity tran- 
sient disturbances modify the structure of interplanetary space and recurrent 
fast solar wind streams are observed less frequently. The recurrence index is 
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larger and stays high for a longer time period in the minimum between an 
even and an odd solar cycle, i.e. at the beginning of a new magnetic cycle 
when the droughts in the western US are also observed. 

10.5.2 Sector Boundaries and Weather Forecast 

Evidence for a possible involvement of the interplanetary magnetic field also 
stems from a correlation between the vorticity area index (VAI) and geomag- 
netic activity. The VAI describes the size of the vortex field on the 500 mb 
surface and can be used as a measure of the strength of the high- and low- 
pressure regions. Thus the VAI is directly related to the large-scale atmo- 
spheric phenomena that determine our weather. On the basis of a data set 
of some hundred sector boundary crossings collected over 20 years, a pattern 
is established with a reduction of the VAI by up to 10% on the day after 
the sector boundary crossing. The reduction is strongest if the interplane- 
tary magnetic field has a southward component (open magnetosphere); it is 
weaker if the interplanetary field is directed northwards. The strongest reduc- 
tions in VAI are observed if the interplanetary magnetic field has a southward 
component and the sector boundary is accompanied by a stream of fast pro- 
tons. Thus somehow energy fed from the interplanetary medium through the 
magnetosphere into the upper atmosphere modifies the fiow pattern in the 
troposphere. 

Modifications to the VAI directly influence the weather. This can be shown 
indirectly: the reliability of the 12 h and 24 h weather forecast in the middle 
latitudes of the northern hemisphere is reduced from 85% to 65% on the day 
following the sector boundary crossing. In addition, the thunderstorm area 
then is largest, too. 

Both correlations give some clues to the possible mechanism. Since the 
tropospheric response is strongest for a southward interplanetary magnetic 
field accompanied by a stream of energetic protons, particles fed into the 
upper polar atmosphere appear to trigger the tropospheric changes, proba- 
bly by modifying the stratospheric heat balance and circulation which then 
modifies the tropospheric circulation pattern. The thunderstorm area s larger 
because ionization caused by the precipitating particles modifies the global 
electric circuit which is directly related to thunderstorm activity. 

10.5.3 Solar Cycle Signals in Tropospheric Winds 

There are also suggestions for changes in the tropospheric wind patterns 
with the solar cycle. In the northern hemisphere, winter storm tracks are 
2.5° farther south during solar maximum than during solar minimum. This 
is related to an equatorward motion of the jet stream, a fast zonal wind in 
the upper troposphere which guides the motion of the pressure systems. For 
an observer at northern mid-latitudes the weather depends on whether she 
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Fig. 10.12. Upper tropospheric circulation pattern for very low (left), average 
(middle)^ and high geomagnetic activity {right). Reprinted from V. Bucha [66], 
Ann. Geophys. 6 , Copyright 1980, Springer- Verlag 



is north or south of the jet stream: south of the jet stream, she is met by 
the low-pressure regions and accompanying storms, giving a mild and rainy 
winter, while north of the jet stream the arctic high with clear and chilly 
skies dominates the weather. 

Not only the average latitudes of the jet stream and the storm tracks shift 
but also their northward and southward excursions. Figure 10.12 shows the 
circulation pattern over the northern Atlantic ocean for extremely low (left), 
average (middle), and high (right) geomagnetic activity. The times of high 
geomagnetic activity are associated with strong westerlies over the Atlantic 
ocean, thus storms move from west to east, leading to mild and rainy weather 
in mid-Europe. With decreasing solar activity (middle), a blocking weather 
pattern develops over the Atlantic Ocean, leading to the penetration of Arctic 
air into mid-Europe, as indicated by the arrow in the middle panel. If the 
geomagnetic activity is very low, the blocking pattern becomes even stronger 
with westerlies only at relatively low latitudes and the intrusion of Arctic air 
well into mid-latitudes. 

The changes in the tropospheric wind pattern are most likely a super- 
imposition of two effects: the increased ionization and heating of the upper 
atmosphere by hard electromagnetic radiation during solar maximum as well 
as the increased precipitation of particles into the atmosphere at high lati- 
tudes. The latter ones obviously change the thermospheric/stratospheric cir- 
culation but do not have enough energy to change the tropospheric pattern. 
Nonetheless, it appears possible that atmospheric waves might provide a suf- 
ficient coupling between the different atmospheric layers to trigger such a 
straightening of the tropospheric flow pattern. 

There is another interesting aspect related to this interpretation. The 
change in the flow pattern might require only a relatively small amount of 
energy. In fact, changes between westerly and blocking patterns are frequent 
and natural, it is only the persistence and higher frequency of the blocking 
pattern at times of low solar activity that cause recognizable tropospheric 
effects. The consequences of this changed pattern, however, might evoke the 
impression of much larger energy inputs. In mid-Europe, a larger number of 
blocking weather patterns, which were observed, for instance, also during the 
Little Ice Age, always mean cold and relatively dry winters. Thus the average 
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Fig. 10.13. Westerly flow 
and blocking weather pat- 
tern over North America. 
Reprinted from Roberts and 
Lansford [443], The climate 
mandate^ Copyright 1979, 
with kind permission from 
Freeman Co. 



temperature in mid-Europe would be reduced. But this does not necessarily 
imply a global temperature reduction. Instead, it is more a redistribution 
of heat; thus while in one place the temperatures are lower in another they 
might be higher. Figure 10.13 illustrates this. It shows the main direction of 
the upper tropospheric motion for a westerly flow and a blocking weather 
pattern over North America. The westerly flow brings mild and wet winters 
all over the US while the blocking pattern leads to a far northward excur- 
sion of warm air, leading to unusually high temperatures in Alaska. This air 
cools over Alaska and moves south towards Florida, leading to unusually cold 
eastern winters with a lot of snow. As a consequence, during one such block- 
ing weather pattern in winter 1977, on some days in January temperatures 
were higher in parts of Alaska than in Florida. The western US, on the other 
hand, experiences a period of drought since the moist air coming from the 
Pacific Ocean has moved northwards. Thus even the dependence of the west- 
ern US droughts on the solar cycle can be fitted, at least, qualitatively into 
this picture. 



10.6 The Technical Environment and Solar Activity: 
Geomagnetic Storms 

Solar-cycle-related disturbances in our technical environment mainly stem 
from magnetic storms caused by the variable solar wind. The changing mag- 
netic flux at the Earth’s surface causes induction currents in long leads, in 
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particular in power lines or pipelines. Since these currents can amount to 50- 
100 A, they can trigger overload protectors or cause damage to transformers, 
leading to a disruption of the electrical power supply. In recent years, this 
problem has become more important since power supplies are connected over 
increasingly larger distances. Since a large contribution to the disturbed geo- 
magnetic field is due to the enhanced auroral electrojet, high latitudes, such 
as Scandinavia, Canada, or the northern US, are more frequently affected by 
these events. For instance, the large geomagnetic storm on 13 March 1989, 
one of the largest ever recorded, caused a disruption of the entire 9000 MW 
power supply network of the province of Quebec for more than 9 hours. Com- 
puter networks also are affected or damaged by induction currents. 

Pipelines for natural gas or oil are vulnerable to large geomagnetic storms 
because the induction currents cause corrosion. This effect has been observed 
best at the Alaska oil pipeline, which stretches for about 1300 km from north 
to south through the auroral oval. Where the pipeline is connected to the 
ground, the induction currents lead to strong corrosion. 

The effects of geomagnetic disturbances are not only limited to the Earth’s 
surface. Satellites also can be damaged by induction currents in their elec- 
tronics or on sensitive surfaces, leading to the failure of the satellite or at least 
of some instruments. Occasionally, a satellite might lose its spatial orienta- 
tion in a large geomagnetic storm: some satellites are oriented by reference to 
the geomagnetic field. If the latter is strongly disturbed, in particular if fluc- 
tuations are fast or if the magnetopause has moved so far inwards that the 
satellite suddenly finds itself in interplanetary space where the orientation 
of the magnetic field is entirely different, communication with the satellite 
might be lost because the satellite is not able to keep the proper orientation 
of its antenna. Although important at the time of the geomagnetic storm, the 
disorientation does not cause permanent damage since the satellite is able to 
orientate itself properly as soon as average magnetic conditions are restored. 

Higher magnetospheric electron fluxes during magnetic storms also might 
cause satellite failure if the spacecraft becomes electrically charged. This, 
by itself, does not cause harm, but becomes harmful if discharge occurs be- 
tween different components. In addition, satellites are also vulnerable to the 
direct action of energetic particles: high energetic particles, although only 
few in number, might cause radiation damage to the electronics. Obviously, 
increased fluxes of energetic particles also are a risk to manned spaceflight. 

The disturbance of communication systems is not related to geomagnetic 
activity but is a direct consequence of the increased ionization in the iono- 
sphere due to the hard electromagnetic radiation (sudden ionospheric distur- 
bance SID) and to a lesser and spatially more limited extent also to energetic 
particles (polar cap absorptions PCA). Ground-ground communication with 
short and long waves is disturbed because these waves are no longer reflected 
by the ionosphere but are absorbed. Satellite-ground communication also can 
be affected due to enhanced absorption. Even if the signals pass through the 
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ionosphere they might be distorted or their travel time might be elongated, 
making navaids such as the GPS less reliable at times of high solar activity. 

The increased hard electromagnetic radiation during solar maximum con- 
ditions as well as in flares not only enhances ionization but also causes the 
atmosphere to expand outwards. Or, viewed from a fixed altitude, the den- 
sity and temperature increase with increasing solar activity. A satellite flying 
at low altitudes, typically a few hundred kilometers, an altitude used for 
instance by astrophysical and solar research satellites, then experiences an 
increased drag, reducing its speed and therefore also its altitude. In time, the 
satellite will move closer to the denser atmosphere and Anally plunge down. 
Prominent victims of increased solar activity have been Spacelab and the 
Solar Maximum Mission, SolarMax. 

And the Biosphere? While the influence of solar activity on our technical 
environment is undoubted, the influences of solar activity is discussed contro- 
versly. If there are any links, most likely the magnetosphere and geomagnetic 
activity should provide the connection because solar-activity-related enhance- 
ments in hard electromagnetic radiation are absorbed high in the atmosphere. 
Some influences are obvious or can be tested easily in the laboratory. Migrat- 
ing animals, such as pigeons and whales, use the geomagnetic held for ori- 
entation and navigation. They become disturbed during geomagnetic active 
periods. For instance, carrier pigeons launched in a magnetic storm need a 
much longer time to And their home (days compared with some hours) . Some 
micro-organisms seem to have a magnetic held sensor, too, using the held for 
their orientation. Laboratory experiments show that changes in the magnetic 
held can affect the vital functions of these organisms. 

The influence of geomagnetic disturbances on humans is under debate, 
too. Although we still might have a magnetic held sensor, as is suggested 
by some tests, our means of orientation and navigation rely much more on 
visual information. Thus most people probably would not even be able to 
deliberately access the information provided by such a held sensor. Nonethe- 
less, since currents are the carriers of information in the human body, the 
influences of geomagnetic activity should not be ruled out completely, see, 
for example, [261]. An often cited example is the higher incidence of cardio- 
vascular diseases, in particular cardiac infarction, during geomagnetic storms, 
which is suggested by different statistical studies at different places over the 
last 30 years. These correlations often have been doubted since the changes 
in the surface magnetic fleld are small compared with the magnetic flelds 
created by our technical environment; however, recent research suggests that 
the crucial factor probably is not the magnitude but the frequency on the 
fluctuations: it appears that geomagnetic disturbances have frequencies in 
the range of slightly less than and up to a few hertz (geomagnetic pulsations) 
which in principle can lock with the cardiac rhythm. If such locking occurs 
during the vulnerable phase, cardiac arrest may result. 
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Caveat. I want to caution the reader that the above discussion represents 
only a tiny glimpse of a small facet of the complex climate system and its 
connection with solar activity. The correlations and interpretations given here 
are examples of investigations and lines of thought, pointing to mechanisms 
that might contribute to a connection between solar activity and climate. 
The real phenomenon “climate system”, however, cannot be reduced to one 
parameter and a sole cause. And the tampering of humankind with the system 
Earth should not be ignored, too. 




Part III 



The Methods 
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Measurement began our might. 
W.B. Yeats, Last Poems 



Observations in the solar-terrestrial environment are made from the ground, 
such as magnetic field measurements, in situ from rockets and satellites, and 
by means of remote sensing, for instance radio sounding of the ionosphere. 
Measured parameters cover fields and their fiuct nations, plasmas, energetic 
particles, and electromagnetic radiation. This chapter gives a brief overview 
of the basic principles of field, plasma, and particle measurements. It does not 
show examples of instruments because each instrument is designed for a very 
special purpose and therefore has unique specifications. Collected papers on 
measurement techniques in space plasmas can be found in [407,408]. 

To study complex phenomena, not a single parameter but a set of param- 
eters has to be determined. To study the propagation of energetic particles, 
for instance, in addition to the particles the magnetic field and its fluctua- 
tions must be measured, too. Thus not only does each instrument have to 
be designed cleverly to fulfill the specifications for space-flight (such as small 
mass, low power consumption) but also the combination of instruments on 
the spacecraft or rocket has to be chosen to yield the maximum of informa- 
tion for the topic under study. Some special publications contain summaries 
of mission goals as well as detailed technical descriptions of the instruments, 
e.g. SOHO [161], Ulysses [531], Wind [565], Polar [414], and Galileo [180]; 
internet resources are listed in the Appendix under missions. 



11.1 Field Instruments 

The measurement of the electromagnetic field can divided into the measure- 
ment of the magnetic and the electric fields. Since the spacecraft itself gener- 
ates electric and magnetic fields, field sensors are always mounted on booms 
that extend up to some tens of meters from the spacecraft. These booms 
consist of thin strong wires which, during the launch of the spacecraft, were 
tightly wound on a reel. They are extended by the centrifugal force caused 
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be the spacecraft’s rotation. Thus the sensors can be at very long beams in a 
plane perpendicular to the spacecraft’s axis of rotation, while parallel to the 
spin axis only rather short, folded metal rods can be used as booms. This 
latter constraint holds for all axis if the spacecraft is not spinning. A modern 
electric field instrument, such as the one on board the Freja satellite, uses a 
set of six probes extended from booms with lengths between 5 and 15 m; the 
probes on the Viking satellite are mounted on 40 m long wire booms. 

11.1.1 The Magnetic Field 

The magnetic field is described by the magnetic field vector, i.e. the field 
direction and the fiux density. The measurements can be done either by ob- 
serving changes in the magnetic fiux, for instance in the pulsation or fiuxgate 
magnetometer, or by utilizing effects at the atomic level, such as proton 
precession or the Zeeman effect. These latter effects are used for absolute 
ground-based magnetic field measurements only. 

Pulsation Magnetometer. A pulsation magnetometer is a tri-axial ar- 
rangement of three coils. According to Faraday’s induction law (2.6), changes 
in the magnetic fiux through a surface cause an electromotoric force in its 
circumference. Thus the fiuctuating field causes an induction current in the 
coil which is proportional to the change in the magnetic fiux. The pulsa- 
tion magnetometer is used to measure magnetic field fiuctuations in three 
perpendicular directions with each coil giving one fluctuating magnetic field 
component. Since it cannot be used to determine the absolute fiux density, it 
is only used in ground-based observatories, preferentially at high geomagnetic 
latitudes, to give detailed records of magnetic field fiuctuations. 

Fiuxgate Magnetometer. Similar to the pulsation magnetometer, a fiux- 
gate magnetometer is a tri-axial arrangement of three sensors. Fiuxgate mag- 
netometers not only measure the magnetic field fluctuations but also the 
components of a constant field. However, a fiuxgate magnetometer is not an 
absolute instrument but has to be calibrated. 

The sensor is a small transformer, wound around a high-permeability core. 
Its primary winding is excited by a high frequency-current, typically some 
kilohertz. Current and permeability are chosen such as to drive the core 
to saturation during each half-cycle of excitation. The secondary winding 
then detects a time-varying voltage which is related to the input signal by 
the hysteresis curve of the core material. For a high-permeability core, these 
curves are very non-linear and the output signal is highly distorted, containing 
all higher harmonics of the input signal. If there was no external magnetic 
field along the axis of the transformer, the hysteresis loop would be traversed 
in a symmetric manner. In this case, only the odd harmonics of the input 
frequency would show up at the output. In the presence of a magnetic field, 
however, saturation is acquired earlier in one half-cycle of excitation than in 
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the other. This asymmetry adds even harmonics to the output signal. The 
amplitudes and phases of the even harmonics are then proportional to the 
field component parallel to the axis of the transformer. 

Since the odd harmonics are much stronger than the even ones, their 
signals must be eliminated. This can be done with the ring-core or race- 
track transformer: two cores, each with its own primary coil, are operated in 
parallel with a common secondary winding. The primary windings are excited 
by equal currents of opposing direction. Thus if the external magnetic field 
was zero, the output at the secondary coil would be zero, too, because the odd 
harmonics of both primaries cancel. If an external field is present, the odd 
harmonics still cancel, reducing the output signal to the desired quantities, 
the even harmonics. 

Alkali- Vapor Magnetometers. The alkali-vapour magnetometer utilizes 
the Zeeman effect, i.e. the splitting of atomic lines in the presence of a mag- 
netic field. The distance between these splitted lines is proportional to the 
magnetic field strength. 

The sensor consists of two cells filled with atomic vapor, for instance Rb, 
aligned with a photocell. As the first cell is heated, it emits light with fre- 
quencies corresponding to the transition between the different sublevels of 
the two lowest atomic levels. This light passes through a circularly polariz- 
ing filter into the second cell. Here it is absorbed, raising the electrons to 
specific sublevels of the next highest level. Falling back to the lower level, 
these electrons emit light. Since this light is circularly polarized, only certain 
transitions between different sublevels are allowed, and after a brief time all 
electrons will fill the highest sublevel of the lowest level. Since transitions 
from the highest sublevel are forbidden, no further absorption occurs. The 
photocell thus detects the following chain of events: as the first cell is heated, 
light is emitted and detected by the photocell. As absorption sets in inside the 
second cell, this light is reduced until finally all electrons occupy the highest 
sublevel and the full stream of light hits the photocell again. 

The second cell is surrounded by a coil which allows the transmission of 
a radio signal into the cell, redistributing the electrons to different sublevels. 
Thus the pumping process can be repeated. For redistribution to occur, the 
frequency of the radio signal must correspond to the frequency difference 
between the higher states. It is generated by using the output of the photocell 
to control the frequency of an oscillator. Thus the entire system oscillates 
between absorption during pumping and radiation when all electrons occupy 
the highest sublevel. The frequency of the radio signal required to maintain 
the oscillation depends on the frequency of the splitted lines and therefore is 
a measure for the magnetic field strength. 

The alkali-vapor magnetometer is an extremely fast and accurate instru- 
ment; however, it only gives the total field but not its components. 
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Proton-Precession Magnetometers. Proton-precession magnetometers, 
too, are absolute instruments measuring field strength only. They utilize the 
fact that the magnetic moment of the proton makes it a small bar magnet. 
Thus protons can be aligned field-parallel, for instance by putting a container 
with a liquid rich in protons into a coil through which a strong current passes. 
The protons than align along this field. If the current is switched off, they 
try to re-align to the external magnetic field, the one to be measured. Since 
the electrons rotate, they behave like small gyroscopes, precessing around the 
field line. This precession frequency is inversely proportional to the gyromag- 
netic ratio ^fp and proportional to the magnetic field. It can be determined 
from the magnetic effects of the precessing protons on the coil. 



11.1.2 Electric Field Measurements 

The basic idea in electric field measurement is simple: use two (or more) 
probes extending on booms from the spacecraft and measure the potential 
between them. But this method alone is not sufficient to determine the electric 
field because two effects modify this potential difference. First, the motion 
of the satellite through a magnetic field, which always is present in space 
plasmas, causes an electric vxB induction field. In the inner magnetosphere, 
this might be as large as 0.5 V/m. Since this is a well-defined effect, it can 
easily be corrected for, using either simultaneously performed magnetic field 
measurements or the field strength at the satellite’s position predicted from 
a magnetic field model. The second effect is more serious and has to be taken 
into account in instrument design: the probes (and the satellite) interact with 
the ambient plasma, and thus the potential measured at the probe does not 
reflect the electric field alone. Electrons and ions accumulate at the probe, 
forming a plasma sheet around it. If the probe, or part of it, is lit by sunlight, 
electrons also can be removed by photo-ionization. The sum of these currents 
gives the probe’s floating potential (see Fig. 11.1). This floating potential can 
be balanced by applying a negative bias current to the probe. 

An entirely different method for the electric field measurement utilizes 
the drift of electrons in a crossed electric and magnetic field (see Sect. 2.3.2). 
Such an instrument is far more complex; however, its measurements are not 




Fig. 11.1. Currents influencing the potential Vp 
measured by an electric field probe. Currents are 
due to photo-ionization of the probe (Iph) and 
flows of ions (/i) and electrons (/e) towards the 
probe. These latter currents create the floating 
potential, which has to be balanced by a bias cur- 
rent 7b 
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Fig. 11.2. Basic measurement princi- 
ple to determine the electric field from 
the drift of electrons 



influenced by the interaction between the spacecraft and the plasma. Such 
an electric held sensor consists of an electron gun producing electrons with 
a well-deflned energy, a system of coils to generate a homogeneous magnetic 
held perpendicular to the electron beam, and a detector with high spatial 
resolution. The latter is mounted perpendicular to the direction of electron 
injection close to the position of the electron gun (see Fig. 11.2). For a van- 
ishing electric held the electron performs one gyration and is intercepted 
by the detector. In the presence of an electric held, the gyro-orbit is not a 
closed cycle: the E x ^-drift (2.51) modifies the gyro-orbit and the electron 
is intercepted at some distance from the starting point. From this offset, the 
drift speed and the direction of drift can be determined. Combined with the 
prescribed electron speed and magnetic held strength, this gives the electric 
held. 



11.1.3 Wave Measurements 

To identify waves in space we have to measure the electric and magnetic 
fields, as well as variations in the plasma parameters, such as density and 
temperature. The maximum frequency which can be measured is determined 
by the temporal resolution of the field and plasma instruments. It should 
be sufficiently high to allow the identification of all waves of interest in the 
physical question. 

The measurement of waves in space plasmas is a difficult task. In a text- 
book, a wave is sinusoidal of indefinite length. Space plasmas consist of a 
mixture of different waves and fluctuations; the waves often can be bursty, 
localized wave packets, occasionally even solitary waves. In addition, the ob- 
server is not fixed with respect to the wave held but moves relative to it, 
either because of his own motion, which would be the case in the quiet mag- 
netosphere, or because the plasma and fields are swept over him, such as in 
interplanetary space or in the active magnetosphere. Thus the observed held 
and plasma variations consist of both temporal and spatial fluctuations. 

The detection of waves in space plasmas therefore requires plasma and 
held measurements with various sensors and high sampling rates. Multipoint 
measurements are advantageous. For small wavelengths, these can be realized 
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with antennas of various orientation and length. But even if very accurate, 
all these measurements give the fluctuations only. To identify the waves, a 
sound knowledge of wave physics and good guesses on the expected waves 
are required, too. Occasionally, particle measurements can be used to check 
the interpretation of the held fluctuations in terms of waves. One example 
was discussed in Sect. 7.4. 

Multipoint measurements to study linger waves require a large separa- 
tion between the individual instruments and therefore different satellites. 
This can be either done with mini-satellites carrying only filed instruments. 
These mini-satellites are launched with one carrier and deployed at different 
positions. While such mini-satellite projects still are under development, mul- 
tipoint measurements with conventional satellites also exist, for instance the 
Helios measurements (see Fig. 7.25 for an example concerning particles and 
shock waves). Tomographic measurements of the magnetosphere, which also 
include the study of waves, are performed by the four CLUSTER-II satellites 
(sci . esa. int/science-e/www/area/ index . cfm?f areaid=8) 



11.2 Plasma Instruments 

Plasma instruments are designed to measure the density, temperature, speed, 
and composition. They can be divided into two groups: in dense plasmas 
particle energies are low and the instruments measure the collective behavior 
of the plasma, while in rarefied plasmas individual particles are collected. 

11.2.1 Instruments for Dense Plasmas 

The plasma densities in the ionosphere are high, and thus plasma properties 
can be determined from the plasma’s collective behavior, such as currents. 
Rather simple probes and traps can be used to measure densities and tem- 
peratures. The measurement principle is based on a sensor extending into the 
plasma, and drawing electrons or ions from it, depending on the potential of 
the sensor. A trap utilizes the motion of the spacecraft relative to the plasma, 
like a snowplow piling up electrons and ions in front of it. 

Langmuir Probes. The basic plasma sensor is a Langmuir probe. It can 
even detect particles with very low energies by the current they carry. Fig- 
ure 11.3 gives the current-voltage characteristic of a Langmuir probe. The 
space potential Vs is used as a universal reference potential. Its meaning can 
be visualized as follows: imagine a large open- wire grid placed inside an ion- 
ized medium. Its potential relative to a large but remote body, such as Earth, 
can be varied. When the flow of electrons and ions through the holes in the 
grid no longer is modified by its presence, the grid has the space potential. 

Let us now substitute the grid by a plate at the same potential. Then 
electrons and ions are collected at its surface. The electron current is ngCUe, 
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Fig. 11.3. Voltage-current characteristic of a Lang- 
muir probe 



the ion current n\ev{. Assuming quasi-neutrality, we have n = Uq = n{. Thus 
the net current towards the plate is ne{v{ — Ve). Since the electron speed by 
far exceeds the proton speed, a negative current towards the plate results, 
representing the preferred electron current. If we now disconnect the plate 
from its original potential, it soon will be charged negatively up to the floating 
potential Vf. At this point the net flow vanishes and e{riiV{ — n^Ve) becomes 
zero. Note that now the densities of electrons and ions are different. Since 
Ve > V{ still holds, close to the plate the density of the electrons must be 
smaller than the one of the ions. Thus at the floating potential a negatively 
charged plate is surrounded by a cloud of positively charged ions, controlling 
the direction of the current towards the plate, comparable to the space charge 
in a diode. The thickness of the sheath is the Debye length. 

The physical principle of the Langmuir probe is thus based on the different 
masses and speeds of electrons and ions and the resulting dependence of the 
net current on the probe’s potential. The characteristic in Fig. 11.3 has three 
well-defined regions which can be utilized for measurement: 

(a) If the probe’s potential V exceeds the space potential V^, electrons 
are drawn towards the probe while ions are repulsed. Thus the current is 
determined by the electron density rie only. 

(b) If the probe’s potential is between the space potential and the floating 

potential, the current still is determined by the electrons. But only the faster 
electrons are collected at the probe since the retarding potential Vr = — 

V prevents the slower ones from reaching it. Thus the retarding potential 
determines the minimum kinetic energy an electron must have to be captured 
by the probe. Variation of allows us to determine the change in electron 
density with energy, yielding the electron distribution. Since the electron 
distribution is a Maxwellian, the temperature can be determined. 

(c) If the probe’s potential V is smaller than the floating potential Vf, 
electrons are repulsed while ions are collected and the ion density can be 
determined. Since the speed of the ions in general is smaller than the space- 
craft’s speed Vsc, the theoretical treatment is more difficult than for the elec- 
trons. The collection condition than becomes eVr < rriiv‘^^/2, allowing us to 
determine ion masses and abundances. 
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Retarding Potential Analyzer. The retarding potential analyzer basically 
is a Langmuir probe supplemented by some grids. Since with this configura- 
tion a more detailed analysis of the ions becomes possible, it is also called 
an ion trap. At the upper grid, all ions arrive with approximately the same 
speed, the speed of the satellite. The main purposes of the grids are to defiect 
the electrons and to screen the detector from potential changes on other grids. 
But one grid serves a special purpose. Its potential can be varied between 0 
and some tens of volts, defiecting also the lighter ions. Thus the ion compo- 
sition can be determined. The analysis of the current-voltage characteristics 
gives the ion temperatures as well. 

As well as retarding potential analyzers Langmuir probes can be built in 
different shapes, either planar, as discussed above, or spherical or cylindrical. 
The operating principle, however, is the same. 

Impedance and Resonance Probes. An entirely different approach is 
chosen in the impedance and resonance probes. These probes do not measure 
the plasma itself but the electrical properties of the medium, which then 
allow us to derive the plasma density from theory. 

In the impedance probe, the dependence of the dielectric constant e on the 
electron density and radio frequency is used. Thus the probe has to measure 
the dielectric constant in space. In the laboratory, the dielectric constant of a 
material is determined by placing it inside a capacitor and then measure the 
change in its capacity. The same principle is used in space plasmas. Here the 
capacitor is designed to be part of an oscillator. Since the frequency depends 
on the capacity, the electron density can be inferred from it. Impedance 
probes often are fiown on rockets. Since they are fast instruments, the allow 
for a very detailed analysis of the height profile of atmospheric ionization. 

Resonance probes are based on a tuneable transmitter and a receiver. 
There are characteristic frequencies at which the medium between the trans- 
mitter and the receiver starts to resonate. Plasma parameters than can be 
determined from these resonances. 



11.2.2 Instruments for Rarefied Plasmas 

We will now discuss plasma instruments that do not utilize collective effects 
of the plasma but detect individual particles. These instruments therefore 
also can be used in the rarefied plasmas in interplanetary space. 

Mass Spectrometer. A standard instrument for the analysis of a gaseous 
medium is the mass spectrum meter. Mass spectrometers in space obey the 
same principles as earth-bound ones, but have two advantages: first, in space 
a spectrometer does not need a vacuum system since above an altitude of 
about 100 km collisions have become negligible. Second, there is no need to 
ionize the sample because all matter is ionized. 
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Fig. 11.4. Basic principle of a time-of-flight spectrometer 



The basic principle of a mass spectrometer is the deflection of particles 
in a magnetic fleld by the Lorentz force. Using an acceleration voltage and a 
system of shutters, the particle speed can be prescribed. The deflection then 
depends on the particle’s mass-to-charge ratio m/q.ln the upper atmosphere, 
ions normally are singly ionized, and thus their mass can be determined. 

Mass spectrometers are only rarely used on spacecraft since the main part 
of the spectrometer is a large magnet, adding to the weight of the instrument. 
In addition, the high magnetic fluxes might disturb other instruments. 

Time-of-Flight Spectrometer. The time-of-flight spectrometer avoids the 
problems just outlined. The principle of a time-of-flight spectrometer is shown 
in Fig. 11.4. To reach the detector, the ions must pass through a series of 
grids, some of them biased by a steady potential, the central one having 
an oscillating potential. The potential drop Vi between grids 1 and 2 would 
accelerate the ion to a speed \/2q[Vi/mi which depends on the mass-to- 
charge ratio m/q. However, depending on their initial speed, the ions still 
have different speeds. The oscillating potential at grid 3 only accelerates ions 
with speeds close to 2d/T, with d being the distance between grid 2 and 3 and 
T being the oscillation period. Grid 5 now is used as a retarding electrode, 
preventing all ions that have not acquired the maximum possible speed from 
reaching the detector. Thus the ion beam passing grid 2 is velocity Altered 
and only ions with a certain mass-to-charge ratio reach the detector. 

Time-of-flight spectrometers often are combined with electrostatic deflec- 
tion systems. The flight path then more closely resembles the one in a mass 
spectrometer. One example of an extremely complex, but very advanced, 
combination of different sensors is CELIAS, the Charge Element and Isotope 
Analysis System on board SOHO [161]. 

Modern time-of-flight spectrometers are used to measure ions with ener- 
gies up to some MeV/nucl. Thus the instruments not only measure the plasma 
composition but also energetic particles. However, only the lower end of the 
energetic particle population can be analyzed in a time-of-flight spectrome- 
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ter since the higher energies would require longer travel paths and therefore 
larger instruments. 



11.2.3 Energetic Particle Instruments 

The measurement principle for energetic particles is based on their interaction 
with matter. Energy losses are due to ionization, bremsstrahlung, Cerenkov 
radiation, or nuclear interactions [514]. The energy loss inside the detector can 
be measured, giving us clues to the particle’s properties. The most commonly 
used particle detector is a semiconductor. 

If an energetic charged particle traverses matter, electrons will be shifted 
to higher energy levels (excitation) or will be removed from the atom (ion- 
ization). The energy loss of the energetic particle is described by the Bethe- 
Bloch equation (10.3) and depends on the particle’s parameters, i.e. charge 
and speed, but not on the particle’s mass. Thus with only one detector, parti- 
cles cannot be identified. Instead, a particle telescope is used, consisting of a 
stack of detectors surrounded by an anticoincidence detector (see Fig. 11.5). 
The anticoincidence detector serves two purposes: together with the upper 
two detectors, it defines an entrance aperture, i.e. the solid angle out of which 
particles can be detected. Particles coming from outside this angle might also 
hit the telescope and lose energy, for instance in detectors D2 and D3. But 
these particles can be singled out since they also have produced a signal in 
the anticoincidence detector. Since the anticoincidence detector only needs 
to detect the particle without determining its energy loss, often scintillation 
counters are used. 

The information obtained with such a particle telescope is the energy loss 
dE/dx for each detector and the residual energy dE in the detector where 
the particle comes to rest. Combined, this gives the particle’s energy, which 
of course could also have been determined with a single sufficiently thick 
detector. But the use of a stack of detectors also allows the identification of 
the particle. An initial separation between different particle species can be 
done with a suitable choice of the first detector Dl. The thickness of this 
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Fig. 11.5. Basic principle of a particle tele- 
scope: a stack of detectors D is surrounded by 
an anticoincidence detector A 
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Fig. 11.6. Pulse height matrix for a 
particle instrument 



detector can be chosen such that even a slow electron loses less energy than 
a fast proton and even a slow proton loses less energy than a fast a particle. 
Thus, by defining certain thresholds for the energy loss, electrons, protons, 
and heavier ions can be distinguished. This is a crude but simple method 
and can be used in onboard data analysis to assign the particles to energy 
channels. A more careful analysis can be based on a combination of the energy 
losses in individual detectors in a pulse height matrix. Here, even different 
heavier ions and isotopes such as ^He and ^He can be distinguished. 

A sketch of a pulse height matrix is shown in Fig. 11.6. All particles 
that stop in detector D3 are considered, and the specific energy loss AE/dx 
in D2 is plotted versus the residual energy in D3. The dots represent entries 
from individual particles; the lines give the loci expected for different particle 
species. Let us start with the locus for protons. If the proton has a rather 
low energy and just manages to make a signal in D3, its energy loss in D3 is 
small. In addition, it was already rather slow in D2 and thus has experienced 
a rather large specific energy loss there. Such a proton therefore sits at the left 
end of the locus. A faster proton which is stopped just before leaving D3, on 
the other hand, must have been rather fast in D2. It therefore has experienced 
a smaller specific energy loss in D2 but has deposited a rather large residual 
energy in D3. It is therefore located at the right end of the locus. Other 
ions show the same pattern; however, depending on Z, the energy losses are 
higher and the curves are therefore shifted upwards. Two examples, ^He and 
^He, are indicated. The electrons cluster in the lower left corner: owing to the 
lower energy required to reach D3, their energy losses and residual energies 
are small. 

The dots representing individual particles show a scatter around the locus. 
The reasons are twofold, (a) Ionization is a stochastic process, and not a 
continuous energy loss as suggested by the Bethe-Bloch equation. The Bethe- 
Bloch equation therefore gives only an average energy loss, around which the 
real losses scatter, (b) The locus is calculated for a particle moving along the 
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detector axis, that is, an angle of incidence of 0°. Particles in fact hit the 
detector at different angles. If they cross the detector at a rather large angle, 
their travel path in the detector material is longer and, consequently, their 
energy loss is larger. This gives an additional deviation from the idealized 
locus. 

As a consequence, in the example in Fig. 11.6, a clear separation of ^He 
and ^He is not possible. Of the two effects leading to the scatter, we cannot 
influence the statistical nature of the energy loss process. But we could correct 
for different angles of incidence by collecting particles with different angles 
of incidence in different pulse height matrices. To do this, we must know 
the angle of incidence, that is, we have to know the places in which the 
particle hits D1 and D2. These positions can be obtained by using sectorized 
detectors. 

But even the separation of electrons and protons is not always unam- 
biguous. The ionization of an atom by a proton requires the interaction of 
a heavy particle (the incident proton) and a particle of much smaller mass 
(an electron in an atomic shell). During this interaction the travel direction 
of the incident particle is not changed significantly; the proton flies straight 
ahead. If the incident particle is an electron, two particles of equal mass inter- 
act, and the electron is deflected from its original travel path. The resulting 
zigzag path leads to larger energy losses than does a straight travel path. 
Since the deflection angles are distributed stochastically, a broad scatter in 
energy losses results: the cloud in the lower left corner in Fig. 11.6. 

If the particles are identified from the pulse height matrices, this multi- 
ple scattering does not pose a problem in the discrimination of electrons and 
ions. However, since often not all pulse height words are stored and processed 
on board, in many instruments the energy loss in D1 is used to discriminate 
electrons and protons: electron energy losses are below a certain threshold, 
proton losses above that value. However, the threshold is determined for a 
straight travel path: owing to multiple scattering, an electron might lose more 
energy than a minimally ionizing proton following a straight path and there- 
fore might be identified as a proton. This can be circumvented by inserting 
a detector element into the telescope that uniquely identifies the electrons. 
For a similar range in matter (that is, the same stopping detector), electrons 
must be relativistic, while ions are non-relativistic. Thus any detector that 
can identify relativistic particles, such as a Cerenkov detector, can be in- 
serted into the stack in our sample detector in Fig. 11.5 and will provide a 
clear separation between electrons (which produce Cerenkov radiation) and 
protons (which do not). 

Since particle telescopes have a well-defined aperture, they can be used 
to determine the angular distributions of energetic particles too. On an axis- 
stabilized spacecraft, this can be done only by combining instruments looking 
in different directions. On a spinning spacecraft, the situation is simpler: if 
mounted perpendicular to the spin axis, the telescope scans all directions in a 
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plane perpendicular to the rotation axis during one rotation. Thus sectorized 
data can be obtained, which in turn allow us to determine the anisotropy. 

The illustration of a telescope in Fig. 11.5 is only a very simple sketch. The 
thickness and number of detectors determine the maximum particle energy to 
be measured. For higher energies, either more detectors have to be added or 
absorbers can be placed between the detectors. If these are passive absorbers, 
there will be a gap in the energy spectrum obtained by the instrument. Active 
absorbers, on the other hand, do not necessarily give the energy loss exactly, 
but at least some information about whether the particle has stopped in the 
last detector before the absorber or whether it has managed to pass into 
the absorber and has stopped there. Thus a continuous energy spectrum 
extending over a larger energy range can be obtained. 



11.3 Supplementary Ground-Based Observations 

Historically, ground-based observations are the oldest means for observing 
geospace. Although today rockets and satellites are attractive means to study 
the terrestrial environment in situ, ground-based observations nonetheless 
still are important, basically for two reasons. First, in situ observations are 
one-point measurements, giving extremely accurate data for one isolated 
spot without considering its surroundings. Ground-based observatories, on 
the other hand, often give a more regional overview or combined even the 
global picture. For some questions, such as the extent and strength of geo- 
magnetic disturbances, this global information is more useful then the in situ 
observation at one spot only. Thus ground-based observations often can be 
used to complement in situ measurements. In addition, ground-based mea- 
surements in many cases can provide a continuous baseline which allows the 
study of long-term variations. Second, ground-based measurements can pro- 
vide access to regions which cannot be covered by in situ measurements, in 
particular the lower ionosphere up to a few hundred kilometer altitude where 
the atmospheric density is still to high to allow for long-living satellites. 

Ground-based measurements comprise the magnetic field measurements, 
which in turn give information about the ionospheric currents, optical obser- 
vations of the aurora, and measurements of ionospheric parameters by radio 
sounding and radar. Instruments for magnetic field measurements have al- 
ready been described at the beginning of this chapter; the principle of radio 
sounding was discussed in Sect. 4.5. Auroral observations from the ground 
are performed with different cameras, in particular all-sky cameras showing 
the evolution of the aurora during a geomagnetic storm. 

The most comprehensive information on ionospheric parameters can be 
obtained by radar. The basic principle in ionospheric radar is the same as 
used in radar on ships and airports: a radio wave is scattered back by the 
object of interest. During this scattering process the properties of the wave 
change, depending on the properties of the scattering object /medium. Radar 
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can be operated in two modes: in a monostatic radar, the transmitter and the 
receiver are located at the same position; in bistatic mode, the transmitter 
and the receiver are located at different positions. In the latter case the 
two antenna beams define the scattering volume while in the former case 
the scattering volume is defined by the transmitter modulation in the range 
direction. 

In the first instance, the refiected signal is used to derive quantities de- 
scribing the spectrum, such as width, power, or Doppler shift. From know- 
ledge of how the properties of the medium infiuence the spectral shape, the 
parameters of the medium, such as densities, temperatures, or velocities, can 
be inferred. With the EISCAT radar system (www. eiscat . c om/ about .ht ml) 
located in northern Scandinavia, the following plasma parameters can be 
measured: electron density, temperature, and velocity, ion temperature and 
velocity, ion mass, the relative number of negative ions, the gyro-frequency 
of ions, and the ion neutral collision frequency. From these parameters, other 
parameters can be deduced, for instance the electric field, the velocities of 
the neutral component, the ion composition, the current density, the energy 
deposition E j, and the particle’s energy. Although this list is impressive, 
we should be aware of the fact that data are noisy and imperfect and thus 
assumptions have to be made about some of the parameters, allowing us to 
infer the others. For instance, temperature and collision frequency cannot be 
fitted independently. Instead, for heights below about 105 km, assumptions 
about the temperature are made and the collision frequency can be inferred, 
while at higher altitudes the modeled collision frequencies are used to infer 
the temperature. 
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To see a World in a Grain of Sand, 
and a Heaven in a Wild Flower, 
hold Infinity in the palm of your hand 
and Eternity in an hour. 
W. Blake, Auguries of Innocence 



Space physics is the application of physical concepts to a complex and highly 
variable system of environments: the Sun, the interplanetary medium, the 
planets, and the interactions between them. Space physics is therefore differ- 
ent from laboratory physics - it asks different questions, uses other methods, 
and builds models on a different basis. In other words, space physics has a dif- 
ferent view of a physical system from that which laboratory physics has. This 
last chapter is not meant as a comprehensive overview of scientific method- 
ology or philosophy. Instead, it just gives hints about some of the methods of 
space physics which might be worth to be evaluated a little bit further. Thus 
this chapter is meant to help one think about the facts of space physics as 
presented in the other chapters, as well as to think about thinking. 



12.1 Physics in a Complex Environment 

At school, and often even at university level, physics is taught as a set of laws 
that can be applied to more or less complex situations - mostly in laboratory 
experiments, sometimes also to real-world phenomena. Thus physics often 
evokes the impression of certainty: ask a question and either there is (exactly) 
one correct answer to it or there is a measurement method to answer it. But 
that is not what physics is concerned with: physics aims to understand basic 
concepts and principles - not details or numbers. 

A complex environment is different from laboratory physics: there is nei- 
ther a simple measurement nor a simple answer. Instead, a complex environ- 
ment is determined by a multitude of different parameters with non-linear 
interactions between them. Thus a simple input-output model, with a corre- 
spondingly straightforward laboratory experiment, cannot be developed. 
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Instead, in a complex environment we must retreat further to the original 
methods of science. For instance, as Popper suggests, the scientific method 
relies on three ideas: 

• a hypothesis can in principle be falsified, 

• a hypothesis must be based on objective tests, and 

• the results must be repeatable. 

The development of scientific ideas and concepts thus is an interplay be- 
tween model, observation or experiment, and predictions. Although model 
and experiment together can greatly contribute to the development of sci- 
ence, on their own they are at risk of being caught in an infinite circle: only 
the application of model predictions to independent observations can provide 
a test of the model, because the model-experiment combination is at risk of 
selective perception: “we see only what we want to see” . On the other hand, 
controlled experimental settings ensure the repeatability of results. 

The natural environment infiuences all aspects of the scientific method: 

• The measurement problem: in a complex natural system we are observers 
not experimenters. In a laboratory experiment, we modify one parameter 
and study the reaction of the experiment. In space, as in any complex natu- 
ral system, nature continuously changes a lot of parameters simultaneously 
(and in most cases without telling us explicitly) and the system adjusts ac- 
cordingly: each time we measure, we measure in a different system. 

• The data interpretation problem: we observe a phenomenon and some re- 
lated parameters while nature performs the experiment. But we do not 
always know which parameters are relevant and which not and we nor- 
mally observe only a few parameters, not all. 

• The modeling problem: a complex system is often formally described by 
a set of coupled partial differential equations that can be solved only nu- 
merically. In addition, many of the parameters in these equations are not 
constants but can vary over a broad range, often orders of magnitude. 



12.2 The Measurement Problem 

Regarding measurement and experimental methods, space physics can be 
likened to other fields of geophysics, such as earthquake and volcano research: 
it is observational science, not experimental science. We can “order” neither 
a magnitude 7 quake at a certain fault nor a gradual 7 -ray fiare with a 
large CME at the western limb of the Sun. Instead, we observe the quake or 
the fiare. And although we observe the phenomenon, nature does not easily 
supply all the additional information required, such as the exact location, 
magnitude, and kind of the energy release or other supplementary parameters. 

And we cannot even repeat the experiment. Although San Francisco will 
in time experience another strong earthquake, it will never be an exact repe- 
tition of the 1906 quake even if the magnitude and the location of the seismic 
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focus are almost identical: the 1906 quake has already changed the config- 
uration of plates that led to its occurrence, and that configuration will not 
return. The same is true for fiares and the magnetic field configuration in the 
photosphere and corona. 

Some of the measurement problems in space physics include: 

• Noisy data: even a simple instrument, such as a thermometer in atmo- 
spheric research or an electric-field instrument in space research, is influ- 
enced by its surroundings. 

• Incomplete data: 

• incomplete time series, 

• one-point observations, 

• unknown/unobserved parameters. 

• Shifts in instrument response with time. 

Note that in other fields of geophysical research, including atmospheric and 
oceanographic science, the measurement problems are quite similar. 

The sources of noise in the data are manifold. There are instrumental 
sources (heating due to solar radiation, and drifts due to aging of compo- 
nents) and errors in the transmission (the data are transmitted through the 
ionosphere, and thus, in particular during a flare and the accompanying sud- 
den ionospheric disturbance, transmission errors might occur). However, the 
data are also noisy in a different sense: parameters that in laboratory physics 
could be regarded as constants are anything but constant, as will be discussed 
in the next section. 

There are also a number of problems in data acquisition. Some of them 
already have been addressed in the “What I Did Not Tell You” sections. The 
most important constraint is the limited number of measurements, with the 
limitation showing up in three respects. Most quantities in the natural en- 
vironment are time-dependent fields, while most observations are one-point 
measurements only. This field character applies not only for the quantities 
are which classically regarded as fields, such as the electromagnetic field, but 
also for quantities which are normally treated more like material parameters 
such the diffusion coefficient, which is a transport parameter. Thus our ob- 
servation might be as representative as that of an alien space probe that in 
an attempt to study the Earth’s atmosphere was unfortunately dropped into 
the sulphurous ash cloud emitted by an erupting volcano. 

To avoid such bad luck, repeated measurements are helpful because they 
at least allow one to determine average parameters. In space science, this 
repeated measurement is basically the time series obtained by an instrument 
during its lifetime of many years, often even many ten years. In that case, we 
face the long-term evolution of both the observables of and the performance 
of the instrument: we cannot exclude drifts and degeneration in the detectors 
and electronics which might lead to systematic or stochastic errors in the ag- 
ing instrument. For long-term analysis, a time series is often constructed from 
a sequence of observations from different spacecraft of one family, such as the 
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IMP satellites or the GOES satellites. Although the instrument specification 
is often identical, the actual instrument performance might be different. Thus 
the time series is not necessarily homogeneous. It should be noted that most 
terrestrial long-term time series used for correlation, such as those for surface 
temperature, suffer from the same problem; see for example [41]. 

In addition, owing to malfunction of instruments or problems in data 
transmission, time series are often incomplete. Thus if we want to work with 
them, besides the data we need the metadata which contain the information 
about the data, the measurement principle, and the measurement practice. 

The above incompleteness of data is basically a measurement problem: 
with more instruments at more positions in space, our measurements could 
take account of the field character of the observed quantities and would help 
to distinguish between spatial and temporal variations of a given quantity. 
But there is another kind of incompleteness in the data: although a satellite 
is designed to measure many different parameters related to a problem under 
study, not all relevant parameters are measured, simply because the problem 
is too complex and our material constants are not exactly constants. 



12.3 The Data Interpretation Problem 

Data analysis in laboratory experiments often involves a comparison between 
two, occasionally more, measured quantities. Often a correlation analysis is 
used: we measure a distance s travelled by a falling ball and the time t 
required for the fall, plot one against the other, try to find a formal relation 
between them, namely s = gt‘^12^ and determine the relevant constant g. 

In principle, we do the same with measurements in a natural environment. 
However, the situation is different in that normally each measured parameter 
depends on many other parameters. Thus we never get a clean correlation 
such as that in the above laboratory experiment. The situation is more com- 
parable to the same experimental setting of a falling body, but using a feather 
or leaf outdoors on a day when a gusty wind is blowing: we shall probably still 
find some indication that with increasing travelled distance the travel time 
increases, but neither will there be a simple formal relation (merely because 
the wind is not constant, the travel time depends on the angle at which the 
body starts to fall) nor will we be able to determine a constant which can 
be applied universally in the relation between s and t. At best, we might be 
able to give some average 

Thus, even if data are not noisy, we shall not be able to find simple, 
clear correlations. Nonetheless, as in the above example, we shall probably 
find that there is some relation between the parameters and try to prove 
this formally by using some kind of correlation analysis; for methods see for 
example [419]. As a measure of the quality of the correlation, we normally 
state some level of significance, for instance “more than 95%” , which means 
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that the chance that the correlation is accidental is smaller than 5%. This 
latter number is also called the error probability. 

In space physics, such a correlation analysis can be applied to many sets of 
data. For instance, we can correlate the intensities in ESP events with prop- 
erties of the accompanying shock or correlate the intensities of SEPs with 
the 7-emission of the parent flares. In both cases, correlation analysis makes 
sense, because ESPs are accelerated at the shock and particles accelerated in 
flares partly interact with the Sun, producing 7-emission, and partly escape, 
being detected as SEPs. Thus we expect some physical relation between the 
relevant variables: since other parameters, for instance interplanetary trans- 
port, might also influence particle measurements in space, the correlation 
most likely will look more like that for the falling feather than that obtained 
in a well-deflned laboratory experiment. But on physical grounds, we can 
nevertheless expect these parameters to be correlated. And if the correlation 
is at a 95% level (which, with a large data set, can be obtained even if the 
scatter of the data is strong), we feel comfortable with the correlation because 
we can understand it in physical terms and it supports our model. 

The situation becomes more complicated if we lack a good model, as is 
the case in many attempts to study solar-terrestrial relationships. There is a 
tendency to correlate one indicator of solar activity, for instance the sunspot 
number, with almost any parameter in the terrestrial environment, such as 
sea surface temperature, vorticity index, drought index, wheat prices, alcohol 
consumption, the Dow-Jones index, and many more. Let us assume that a 
scientist has access to many different databases and manages to do 100 cor- 
relations. The scientist is happy to And flve correlations with confldence level 
95% (or error probability less than 5%) and, although he/she has absolutely 
no idea why these parameters should correlate with the sunspot number, 
he/she writes a paper on these flve correlations and submits it. The first 
referee is impressed by five correlations, each at a 95% confidence level, and 
recommends publication. The second referee is more sceptical, asks the au- 
thor for the number of parameters tested, and, on learning about the 100 
attempts to find a correlation, rejects the paper. The second referee obvi- 
ously has a better understanding of probabilities and significance: if I do one 
correlation and get an error probability below 5%, this is fine. If I do five 
correlations, each with an error probability below 5%, this is also fine. But if 
I do 100 correlations, out of which five have an error probability below 5%, 
I have probably just found the accidental ones: the error probability of 5% 
also means that, on average, 5% of my correlations are positive by chance. 
Therefore reports about correlations without any idea about a reasonable 
relation between the parameters should be treated with caution. 

In solar-terrestrial relationships, two time series are often compared, e.g. 
Figs. 7.30 and 8.42. If the data stretch is short, for instance 4 years in the de- 
clining phase of the solar cycle, the sunspot number will correlate (or anticor- 
relate) with almost any parameter which shows a steady temporal evolution. 
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Fig. 12.1. Correlation be- 
tween pairs of breeding storks 
{open symbols) and millions of 
newborn babies {closed sym- 
bols). Reprinted from H. Sies 
[483], Nature 332, Copyright 
1998, with kind permission 
Nature Publishing Group 



Although the chance of an accidental correlation decreases with increasing 
length of the data stretch, we still have a remaining risk and, if we make 
enough correlations, will shall find one by chance. 

However, again we feel more comfortable with such a correlation between 
time series if we expect a correlation from our understanding of the system. 
A good example of such a correlation has been published in Nature and is 
reprinted in Fig. 12.1. The author, Helmut Sies [483], notes: “There is concern 
in West Germany over the falling birth rate. The accompanying graph might 
suggest a solution that every child knows makes sense.” But the author has 
not made full use of all the information contained in that correlation. Beck- 
Bornholdt and Dubben [38] have noticed that the time lag between the curves 
also shows that storks start their work only at the age of two, and not earlier. 

The above examples are extreme. They should caution the reader: sci- 
ence in a complex environment is often at risk of being fooled or of fooling 
itself by misunderstood applications of basic statistical concepts. Part of this 
reflects the desire of mankind to describe its environment, often by use of 
grouping and categorizing. But we should limit correlations, grouping, and 
categorizing to parameters and systems where they can be applied reason- 
ably. Benestad [41] refers to a comment by D.B. Stephenson (University of 
Reading): “We have grouped stars that we know are randomly distributed 
into stellar constellations which we have named.” Again this should serve to 
caution the reader. 

Not only does data analysis face challenges not normally encountered in 
laboratory experiments, but the results often also have a different meaning. 
Again, take the falling-body experiment. In the laboratory experiment, we 
do the experiment to determine a constant, the gravitational acceleration g. 
With the falling feather in windy weather, there is no constant that describes 
the motion, but some kind of acceleration parameter that consists of the 
combined effects of gravitational acceleration and drag, and varies depending 
on the variations in the wind and therefore in the drag. The situation is similar 
in many respects to space physics. For instance, if we study interplanetary 
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transport, formally the diffusion coefficient in the transport equation can be 
regarded as some kind of material constant, similar to the index of refraction. 
But if we remind ourselves of the nature of the transport process, the pitch- 
angle scattering in wave fields, we realize that the diffusion coefficient depends 
on the ever-changing properties of the wave field and also on the particle 
properties - it is more similar to the of the falling feather than to the g 
of the laboratory experiment. 

For the scientist interested in interplanetary propagation, this makes sci- 
ence interesting. For the researcher interested in modulation studies or shock 
acceleration, however, this makes science difficult because here it would be 
much simpler if the diffusion coefficient could be treated as a constant ma- 
terial parameter and not as a quantity that varies in time and space. Thus 
estimates such as that of the characteristic acceleration time in diffusive shock 
acceleration in example 23 have to be treated with caution: although for av- 
erage scattering conditions, the estimate suggests that the acceleration of 
10 MeV protons is rather slow, this does not exclude the possibility that 
there are a few shocks in which this process works nicely and extremely fast 
because scattering is much stronger. 



12.4 The Modeling Problem 

In physics, a model is formulated in terms of causes and consequences: the 
electric field exerts a force on an electron and accelerates it. The ideas of cause 
and consequence also hold in the physics of natural environments, the only 
difference being that one cause can have many consequences and one conse- 
quence is a result of many causes: we cannot reduce the system to just one 
cause-consequence chain as we would attempt to do in a laboratory experi- 
ment. Consequently, models are more complex and, formally, often consist of 
sets of coupled partial differential equations. Although analytical approxima- 
tions are valuable for understanding the characteristics of the solutions and 
for obtaining an idea of the dependence of the solution on various parameters, 
most models rely on numerical solutions. 

Thus a physical model in space science is developed from physical ideas 
and observations, often in the form of correlations. Its formal treatment and 
its test require numerical simulations. As mentioned above, these simulations 
often require assumptions about certain parameters, such as the diffusion 
coefficient. Therefore, these numerical models are often used for parameter 
studies to understand the influence of the variation of each parameter on the 
solution. This approach is similar to the one normally chosen in laboratory 
physics: keep all parameters constant and well defined and study the reaction 
of the system to just one parameter. Therefore, in the case of a natural 
system, laboratory experiments are often replaced by numerical experiments. 
An example is shown in Fig. 7.12, which studies the energy dependence of 
the effects of the solar wind in interplanetary transport for a fixed solar 
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wind, a fixed observer’s position, fixed particle injection, fixed interplanetary 
propagation conditions, etc. 

As with laboratory experiments, we must be careful that our numeri- 
cal instruments do not lose touch with the natural system. Therefore, we 
have to test our simulations. This, basically, can be done in two ways. The 
interplanetary-transport equation, for instance, has been applied to a large 
number of particle events; one example is shown in Fig. 7.13. The fact that 
the model is able to fit the observations does not prove the model, but at least 
it increases our confidence in it. Thus the confidence gained from multiple 
successful application of a model to data is comparable to the reproducibility 
of laboratory instruments. 

In addition, the validity of a model can be assessed by its ability to make 
correct predictions. For instance, it was recognized early that with average 
parameters, diffusive shock acceleration is too slow for a large energy gain. It 
was also recognized rather early that particles streaming away from the shock 
might generate waves which would increase the acceleration efficiency. Thus 
the prediction was: if you see particles upstream of the shock, you should find 
waves too. A closer look at the data confirmed this. 

Another example of prediction and, in this case, the disproof of a model 
is the following. Originally, the heliosphere was assumed to be rather irreg- 
ular in the lower latitudes above the streamer belts, because here different 
solar wind streams interact and transient disturbances (shocks and CMEs) 
constantly modify the structure. Over the poles, on the other hand, there is 
no activity, and the solar wind out of the coronal hole is fast and contains 
almost no turbulence. With only a small amount of turbulence, there is not 
much scattering of particles and, in particular, galactic cosmic rays should 
have easy access over the poles of the Sun, similarly to the solar energetic 
particles precipitating at the cusps of a planetary magnetosphere. Therefore, 
a mission over the poles of the Sun should allow one to observe galactic cosmic 
rays without too much modulation by the heliosphere, and thus allow one to 
determine the local interstellar spectrum more accurately. As a consequence 
of this prediction, Ulysses was designed. Once it was over the poles, the ob- 
servations disproved our previous concepts of the structure of the heliosphere 
and the physics of modulation. 



12.5 Is This Still Physics? 

I am working in a physics department with colleagues all of whom do labora- 
tory physics or theoretical physics. During seminars and colloquia, sometimes 
the question arises: is this still physics? All these uncertainties, the unknown 
parameters, the incompleteness of observations. Can this be physics? 

Yes, of course it is physics. It is the application of physical concepts to 
a natural system. Space physics works according to the same principles of 
science as laboratory physics does: models, observations, repeatability, and 
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prediction. The only difference is the broader scope, the larger number of 
parameters, and the broader scatter in them. It is sometimes difficult for a 
laboratory physicist to grasp that an order-of-magnitude estimate in space 
physics may be quite good while he/she fights to determine some other con- 
stant with an accuracy of less than one per thousand. 

All these “problems” and uncertainties in space physics, however, also 
make it an attractive topic to study or to do research in it: like any work in a 
complex environment, it is an intellectual challenge. To understand a simple 
phenomenon such as a flare, a large number of scenarios may be developed, 
and with each new observation another piece of a jigsaw puzzle is provided 
- which might fit into its own picture or into an entirely different one. 

For whatever reason you have read this book, it may be study, research, 
or just interest, I wish you a lot of fun and success in this intellectually 
challenging field of space physics. 
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A.l List of Symbols 

a pitch angle 

/? plasma-/? 

7 spectral index 

7 Lorentz factor 7 = — v‘^/c^ 

7a adiabatic exponent, Cp/cy 

T] (dynamic) viscosity 

ip electric potential 

^ magnetic flux 

/^(//) pitch angle diffusion coefficient 

A mean free path 

A wave length 

Ac Coulomb logarithm Ac = In yl 

Ad Debye length 

Aoe electron Debye length 

fji magnetic moment 

z/ kinematic viscosity 

u frequency 

Pc collision frequency 

Pei electron-ion collision frequency 

u) angular frequency 

Q angular speed 

n solid angle 

LUc cyclotron frequency 

LOce electron cyclotron frequency 

cdcp proton cyclotron frequency 

oup plasma frequency 

a;pe electron plasma frequency 

Upp proton plasma frequency 

'ip azimuthal angle in a circular motion 

^ stream function 

p mass density 

pc charge density 
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Pe electron density 

a conductivity 

G scattering cross-section 

r collision time 

0Bn angle between shock normal and magnetic field direction 

X thermal conductivity 

X degree of ionization 

A area 

A vector potential 

B magnetic field 

Bmp magnetic field at the mirror point 

c speed of light 

Cp specific heat at constant pressure 

cv specific heat at constant volume 

dmin minimum distance of charges in a plasma 

D spatial diffusion coefficient 

D diffusion tensor 

E electric field 

Ebo energy of the ground state (Bohr model) 

Eel electrostatic energy 

Bp Fermi energy 

Brel relativistic electron energy 

Bth thermal energy 

e unit vector 

F force 

f force density 

f{r^v^t) distribution function (phase space density) 
g gravitation 

G universal constant of gravitation 

h Planck’s constant 

j current density 

jD drift current density 

J differential flux 

Ji action integral 

k wave vector 

k wave number 

/cb Boltzmann’s constant 

L characteristic length scale 

I vector along a path 

m mass 

rUe electron mass 

rrip proton mass 

n number density 

Tie number density of electrons 
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77-p 


number density of protons 


P 


momentum 


Pi 


generalized momentum 


P 


pressure 


Pm 


magnetic pressure 


P 


stress tensor 


P 


rigidity 


Q 


charge 


Qi 


generalized spatial coordinate 


r 


radius vector 


Tc 


radius of curvature 


Th 


Larmor radius 


The 


electron Larmor radius 


Ru 


magnetic Reynolds number 


Rwp 


mirror ratio 


ro 


impact parameter 


S 


vector defining a surface 


s 


path length (along the Archimedean spiral) 


Sp 


Poynting vector 


T 


temperature 


tp 


tangent vector to the magnetic field 


Te 


electron temperature 


Tc 


gyration time (cyclotron period) 


u 


bulk velocity 


U 


particle flux 


t^sowi 


solar wind speed 


V 


velocity of a particle 


v± 


speed perpendicular to the field 


n 


speed parallel to the field 


VA 


Alfven speed 


vd 


drift velocity 




electron speed 




group speed 


Vgc 


velocity of the guiding center 


Vp 


proton speed 


fph 


phase speed 




sound speed 


^th 


thermal speed 


t^th,e 


thermal speed of electrons 


Wkin 


kinetic energy 


^^kin,|| 


kinetic energy parallel to the field 


VFkin,± 


kinetic energy perpendicular to the field 


W^th 


thermal energy 




atomic number 
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A. 2 Equations in the SI and cgs System 



Maxwell’s equations: 



V • E — Qc/f-O 


<] 

II 

n 


(A.l) 


V-B = 0 


V-B = 0 


(A.2) 


r-7 r. 9B 


^ ^ ^ dB 

V xE = — — 
c at 


(A.3) 


„ „ , 1 8E 

VxB = moJ + ^-^ 


„ „ 47t . IdE 

V X B = —j + - — 
c c at 


(A.4) 


Field transformations: 






E' = E -\-v X B 


E' = E + -V X B 
c 


(A.5) 


B' = B - X E 

& 


B' = B--vxE 
c 


(A.6) 


Ohm’s law: 






2 = (j{E xB) 


j = a ^E + -V X B^ 


(A.7) 


Energy densities and fluxes: 






52 

= T. 

2iio 




(A.8) 


eoE^ 




(A.9) 


/^O 


^^ExB 

47T 


(A.IO) 


Single particle motion: 






F = q{E + v X B) 


^ VXB\ 

F = ,^E+ ^ j 


(A.ll) 


qB 

(jOc = 

m 


qB 

LOc — 

me 


(A.12) 


mv± 

~ \q\B 


mv±c 

k\B 


(A.13) 


p=^ 

9 


p = ^ 
Q 


(A. 14) 


Fx B 

«Dnft- ^^2 


c F X B 

'^Drift — j-^n 

q 


(A.15) 
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Magnetic pressure 



Pm = 



2//q 



Alfven speed 



va = 



Bn 



VP^oQo 

Electron plasma frequency 






rrieeo 



Ion plasma frequency 



CJpi — 



Inoe^ 

rriieo 



B^ 



VA = 



Bo 



\/47r£>o 






nip 



I npe^ 

mi 



A. 3 Useful Relations 

Ax{BxC) = B{A ■ C) - C{A ■ B) 
{AxB)-{CxD) = {A- C){B ■D)-{A- D){B ■ C) 



A. 3.1 Vector Calculus 



d 



d 






VA 



dAa- dA„ 



dx 



+ 



dy 



+ 



dA, 



dz 



V X A — I — — — 

V X .,4. — I ay dz 

y Ax Ay Aj2 

V • (V X A) = 0 

V X (V X A) = V(V ■ A) - V^A 
V X V0 = 0 
V • v<^ = vV 

V-{AxB) = B-{VxB)-A-{VxB) 

V X (A X B) = A(V ■ B) - B(V • A) + (B • V)A - (A • V)B 



(A.16) 



(A.17) 



(A.18) 



(A.19) 



(A.20) 

(A.21) 



(A.22) 

(A.23) 

(A.24) 

(A.25) 

(A.26) 

(A.27) 

(A.28) 

(A.29) 

(A.30) 




436 Appendix 



V(A . B) = A X (V X jB) + B X (V X A) + (A-)B + {B • V)A (A.31) 

y is a volume enclosed by a surfac S. dS = endS is a vector normal to this 
surface. Then we have 



/ =V(l)dV= [ (l)dS 
Jv Js 



Divergence theorem (Gauss’s theorem): 



/ V • Ad^x - A dS 
Iv Js 

f V Tdy= [ T dS 



(A.32) 



(A.33) 



/ Vx AdF= dSxA 
Iv Js 



f V Tdy= [ T dS (A.34) 

Jv Js 

f WxAdV= [ dSx A (A.35) 

Jv Js 

f ((/)VV - /9V20) dF = [ {(f>Vp- pV(j))-dS (A.36) 

Jv Js 

(A-VxVxS-B-VxVxA) = [ (BxVxA-AxVxB).d5 (A.37) 

Js 



(A.35) 



(A.36) 



S is an open surface enclosed by a curve C with length element dl: 



Stokes’ law: 



dSxV(t)= (p dl(f) 

Js Jd 

[ {V X A) ‘dS= (f A- dl 
Js Jc 

[ {dSxV)xA = (f dlxA 
Js Jc 

/ {dS • (V0) X Vp = (p (f) dp — — (p pd(j) 
Js Jc Jc 



A. 3. 2 Cylindrical Coordinates 



V X A 



_ . Id... 1 dAo dAz 

r dr r dO dz 

V7A- 

\dr'rde'dz) 

1 dA^ dA$ dAr dAz 1 d{rAg) 1 dAr 

r 86 dz ' dz dr ’ r dr r 86 

V2^ I d^(f> d‘^(t> 

r \ r ) 86"^ dz'^ 



(A.38) 



(A.39) 

(A.40) 



(A.41) 



(A.42) 

(A.43) 



(A.44) 



(A.45) 



= VM 



2 dAe Ar 2 A I 
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A. 3. 3 Spherical Coordinates 



V • A = — — {r^Ar 



dr 






r sin 9 86 



^ I dA^ 

{Ae sin 6) + 



4 



rsin^ d(j) 






1 dp 



dr' r 86' rsmO dcp 



(A.47) 

(A.48) 



/ 1 d{Ad) sin 6) 1 dAe \ 

\rsin0 86 rsin6 d<t) ' ) 

/ 1 8Ar 1 8{rA(iy) 1 8{rAe) 1 8Ar 

\ r sin 6 d(p r dr ' r dr r 86 



(A.49) 



—— ( r^ — 
dr V dr 



1 d / ^dp 
r2 sin 61^ 



+ 



8‘^p 



sin^ 6 dcj)^ 



(A.50) 



A. 4 Useful Numbers 



A. 4.1 Fundamental Constants 



c 


speed of light in a vacuum 


2.998 X 10® m/s 


^0 


permittivity in a vaccum 


8.854 X 10-12 F/m 


1^0 


permeabilty in a vaccum 


47t X 10~^ Vs/Am 


e 


elementary charge 


1.602 X 10-19 C 


rrie 


electron mass 


9.109 X 10-31 j^g 




electron rest energy 


511 keV 


Trip 


proton mass 


1.673 X lO-^i" kg 




pront rest energy 


939 MeV 


mpjrrie 


proton-to-electron mass ratio 


1.836 X 10® 


a 


Stefan-Boltzmann number 


5.6708 X 10-® Jm-2s- 


kn 


Boltzmann number 


1.381 X 10-23 J/K 


eV 


electronvolt 


1.602 X 10-19 J 


eV/kB 


temperature equivalent for 1 eV 


1.160 X 10^ K 


G 


gravitational constant 


6.673 X 10-11 Nm2kg- 


h 


Planck’s constant 


6.623 X 10-34 Js 



A.4.2 Numbers in Plasmas 

In the following numerical expressions the plasma parameters have to be 
inserted in SI units [as long as no other units are indicated in squre brackets] : 
the electron density has to be inserted in m~^, B in Tesla, v± in m/s and 
the electron temperature Tq in K. The fundamental frequencies in plasmas 
then are given as: 
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cjpe electron plasma frequency 0.564 x rad/s 

cjpp proton plasma frequency 1.32 x y/n^ rad/s 

cjce electron cyclotron frequency 1.76 x 10^ ^5 rad/s = 28B[nT] Hz 

cjcp proton cyclotron frequency 9.58 x 10^ J3 rad/s = 0.01525B[nT] Hz 

These are angular frequencies. Frequencies can be obtained by division by 
27T. 

Important length scales and speeds in a plasma can be calculated according 
to the following rules: 



^Th,e 


electron thermal speedy ^/^2kpTe/m 


5.51 X 10^^/T^ m/s 


'^Th,i 


ion thermal speed, y/2kBT[/M[ 


129 X y/Ti • rUp/Mi m/s 


Aoe 


electron Debye length, ^/kBTe/m/uJpe 


69 X y/Te/ne m 


The 


electron Larmor radius, v±/ujce 


5.68 X m 


r*Lp 


proton Larmor radius, v±/ucp 


1.04 X m-^vx/B m 


Vs 


sound speed 


0.117VI; km/s 



A.4.3 Conversion of Units 

Occasionally, electrical units have to be converted into mechanical ones or vice 
versa. The following table gives some physical quantities in diflFerent units, 
which should help in unit conversion. It is neither complete nor systematic. 
Note that factors are not included, just different ways to express a unit! 

B magnetic field T kg s~^A“^ cm“^/^g^/^s~^ 

E electric field V/m m kg s“^A~^ cm~^/^g^/^s~^ 

I current A cm^/^g^/^s“^ 

Q electrical charge C As cm^/^g^/^s“^ 

Sp Poynting vector W/m^ kg s~^ g s“^ 



A. 5 Useful Internet Resources 

All information contained in the following list can be found via “clickable” 
links at www . physik . uni-osnabrueck . de/sotere/spacebook/intro . html. 



A. 5.1 Space Physics - General 

Some useful general resources for space physics are: 

• The Naval Research Laboratory has a general plasma physics page giving 
all relevant equations: 

wwwppd . nr 1 . navy . mil/nr If ormulary /nr If ormulary . html 

• Some collections of links to other space physics resources are provided by: 
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• Geospace Environment Data Analysis System: 

gedas22 . stelab . nagoya-u . ac . jp/website/website . html 

• the space science web page of the University of Iowa: 
www-pw . physics . uiowa . edu/links/space . html 

• links to space physics educational sites are provided by the Space Envi- 
ronment Center of NO A A at: 

WWW . sec . noaa . gov/Educat ion/ed_sites . html 

A. 5. 2 Textbooks and other Educational Material 

There are some textbooks on space physics available on the Internet: 

• The University of Oulu provides a space physics textbook at: 

WWW . oulu . f i/'^spaceweb/textbook/ 

• Bo Thiede from the University of Uppsala provides a textbook on classical 
electrodynamics at: www . plasma . uu . se/CED/Book/ 

Additional educational tools on the net include: 

• “What is Geospace” is an educational page by NASA: 

http : //www~ istp . gsf c . nasa . gov/istp/outreach/geospace . html 

• NOAA’s primer on space weather can be found at: 
http : // WWW . sel . noaa . gov/primer/primer . html 

• an exploration of the Earth’s Magnetosphere is possible with D.P. Stern 
and M. Peredo at: www-spof.gsfc.nasa.gov/Education/Intro.html 

• an encyclopedia of the atmospheric environment can be found at: 

WWW . doc . mmu . ac . uk/aric/eae/english . html 

• the Global Climate Change Student Guide is at 
WWW . doc . mmu . ac . uk/ aric/gccsg/ 

• a space physics tutorial from the Space Science Group of the University of 
California, Los Angeles, is provided at: 

www-ssc . igpp . ucla . edu/ ssc/ tutorial . html 



A. 5. 3 Missions 

Information about space missions can be obtained from the homepages of 
NASA and ESA. Homepages for some of the larger missions are listed below: 

• Solar and Heliospheric Observatory (SOHO) homepage at NASA: 
s oho WWW . nascom . nasa . gov 

• SOHO homepage at ESA: sohowww.estec.esa.nl 

• Homepage of the Extreme ultraviolet Imaging Telescope (EIT) on SOHO: 
umbra . gsf c . nasa . gov . eit 

• LASCO homepage: lasco-www.nrl.navy.mil/lasco.html 

• The TRACE homepages are lmsal.com/solarsites.html and vestige. 
Imsal . com/TRACE/POD/TRACEpodoverview . html 
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• Yohkoh homepage: www . Imsal . com/SXT 

• Ramaty High Energy Solar Spectroscopic Imager (RHESSI) homepage at: 
hesperia.gsfc.nasa.gov/hessi/ 

• STEREO stp.gsfc.nasa.gov/missions/stereo/stereo.htm 

• Advanced Composition Explorer (ACE): www.srl.caltech.edu/ACE/ 

• Wind: www-spof.gsfc.nasa.gov/istp/wind/ 

• SAMPEX (Solar, Anomalous, and Magnetospheric Particle EXplorer): 
sunland . gsf c . nasa . gov/smex/sampex/ and surya . umd . edu/www/sampex . 
html 

• Ulysses: ulysses.jpl.nasa.gov/ 

• Interplanetary Monitoring Platform (IMP-8): 
nssdc . gsf c . nasa . gov/space/imp-8 . html 

• IMAGER (Imager for Magnetopause to Aurora Global Explorations): 
image . gsf c . nasa . gov/ 

• POLAR: http://www-spof.gsfc.nasa.gov/istp/polar/ 

• CLUSTER-II: sci.esa.int/science-e/www/area/ index. cfm?fareaid= 

8 

• Oersted/ CHAMP: web . dmi . dk/pro j ects/oersted/ and op . gf z-potsdam . 
de/ champ/ index_CHAMP . html 

• Acompilation of links to solar and space physics missions is given at: 
members . aol . com/gca7sky /missions . htm 



A. 5.4 The Sun 

• General information concerning the Sun and a large number of useful links 
to other sites are given by Bill Arnett on: 

seds . Ipl . arizona.edu/nineplanets/nineplanets/sol . html 

• “The Sun: a pictorial introduction” is provided by the High Altitude Ob- 
servatory (HAO) at WWW . hao . near . edu/public/slides/slides . html 

• Big Bear Solar Observatory: www . bbso . nj it . edu 

A. 5. 5 Solar-Terrestrial Relationships 

• Space Environment Center: www . sel . noaa . gov/ index . html 

• NOAA’s space weather: www . spaceweather . noaa . gov/ 

• ESA’s space weather page: www.estec.esa.nl/wmwww/wma/spweather/ 

• NASA’s Sun Earth Connections page: sec.gsfc.nasa.gov/ 

• Windows to the Universe: Basic Facts about Space Weather: 

WWW . windows . ucar . edu/ spaceweather/basic\_f act s . html 

• Lund Space Weather Center: www.lund.irf.se/ 

• Center for Integrated Space Weather Modeling (CISM): 

WWW . bu . edu/ cism/ 

• Space weather resources compiled by Rice University: 
space . rice . edu/ISTP/ 
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• Space weather resources at NSSDC: 

spdf .gsfc.nasa.gOv/space_weather/Space_Weather_at_SSD00.html 



A. 5. 6 Aurorae 

Some useful links to pictures and general aspects of aurorae are 

• Poker Flat Research Range, Fairbanks, Alaska: 

WWW . pf rr . alaska . edu/aurora/INDEX . HIM 

• University of Michigan aurora page: www.geo.mtu.edu/weather/aurora/ 

• Norwegian Northern Light page: www.northern-lights.no/ 

• Jan Curtis aurora page, probably the best photographs on the Web: 
www.geo.mtu.edu/weather/aurora/images/aurora/jan.curtis/ and the 
more recent version: climate . gi . alaska.edu/Curtis/curtis.html 

• Polar image page, also covers noctilucent clouds: www.polarimage.fi 

Predictions of aurorae from particle measurements in space can be found at 
sec . noaa . gov/pmap/. 



A. 5. 7 Data 

• Space Physics Interactive Data Resource (SPIDR) at the National Geo- 
physical Data Center: spidr.ngdc.noaa.gov/spidr/index.html 

• Solar Data Analysis Center at Goddard Space Flight Center: 
umbra . nascom . nasa . goc . sdac . html 

• Solar and Upper Atmospheric Data Services: 

WWW . ngdc . noaa . gov/SOLAR/solar . html 

• National Oceanographic and Atmospheric Administration (NOAA): 

WWW . SpaceWeather . com 
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solar-terrestrial relationships 11 

sound speed 96, 102, 198 

sound wave 102 

source surface 147 

South Atlantic Anomaly 285, 342 

Sporer’s law 173 

speciflc heat ratio 56 

spiral angle 229, 274 

sprite 395 

sputtering 370 

Sq variation 286, 323 

sq-current system 306 

sq- variation 306 
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St0rmer orbits 349 
St0rmer unit 352 
stand-off distance 294 
standard deviation 225 
standard QLT 239, 262 
standard solar model 138 
standing shock 196, 199 
starsphere 7 

Stefan-Boltzmann law 136, 143, 170 
stellar activity 382 
STEREO 187 

stochastic acceleration 242,245,251, 
252, 257, 258, 262, 263 
stop band 108 
stopping height 387 
stratosphere 300 
stream function 

- magnetic 71 
streamer belt 146, 150, 268 
streaming 

- diffusive 226 
streaming distribution 117 
stress tensor 52 
subsolar point 265, 293, 294 
substorm 235, 324 

- Mercury 359 
substorm current 309 
substorm current wedge 320 
Sudbury Neutrino Observatory 139 
sudden commencement 324, 325 
sudden ionospheric disturbance (SID) 

304, 352, 377, 401 
sunspot 4, 62, 170, 174 
sunspot cycle 12 
sunspot number 171 
sunspot relative number 171 
Super-Kamiokande 138 
sympathetic coronal mass ejection 
179 

sympathetic flare 179 

tachocline 140, 173 
tail current 296, 312 
tangential discontinuity 203, 313 
Taylor column 291 
temperature 

- effective 136 
tensor 52 
terella 13 



termination shock 154, 214 
thermal conductivity 73 
thermal energy 5 
thermal speed 101, 115 
thermosphere 302 

- circulation 393 
theta aurora 322, 339 

tilt angle 161, 163, 273, 276 
time to maximum 227 
time-of- flight spectrometer 415 
TRACE 148, 157 
transformation 

- electromagnetic field 20 
transition region 142 
transport equation 275 
travelling shock 196, 199 
tropopause 300 
troposphere 300, 380 
turbulence 

- Alfvenic 166 

- fast solar wind 167 

- slow solar wind 167 
turbulent conductivity 83 
turbulent dissipation time 83 

Ulysses 168, 272 

- magnetic field 168 

- solar wind 169 

- trajectory 168 
umbra 170 

upper hybrid frequency 104 
upper hybrid oscillation 103 
upstream turbulence 256 

viscosity 53 

- collisionless 53 

- kinematic 52, 53 

- magnetic 69 
viscosity coefficient 53 
Vlasov equation 120, 231 
von Humboldt 13 
vorticity area index 398 
Voyager 271,272 

wave 

- acoustic 183 

- Alfven 94, 156, 165, 166, 170, 233, 
324 

- compressive 97 
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- electromagnetic 105 

- electron plasma 101 

- electrostatic 99 

- electrostatic ion 104 

- extraordinary 108 

- fast 97 

- ion 102 

- ion acoustic 102, 166 

- ion cyclotron 104 

- ion-cyclotron 166 

- Langmuir 101 

- lower hybrid 105 

- magneto- acoustic 183 

- magneto-sonic 94, 97 

- MHD 94, 97, 156 

- Moreton 183 

- ordinary 108 

- plasma 101 

- R and L 109 

- slow magneto-sonic 97 



- sound 102 

- ultra- low-frequency (ULF) 324 

- upper hybrid 104 

- Whistler 110, 166, 235, 310 
wave vector 90 

wave-particle interaction 10, 230 
Weber 12 

Whistler 109, 110, 166, 234, 235, 310, 
361 

Wien filter 30 
Wien’s law 143 
WIND 149 
Wolf 13 

Wolf number 171 

X-point 75, 189, 296, 313, 318 

Zeeman effect 146, 409 
zodiacal light 145 




